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USE OF HEPARANASE TO IDENTIFY AND ISOLATE ANTI-HEPARANASE 
CXDMPOUND 
FIELD OF INVENTION 
The present invention discloses the use of mammalian heparanase, preferably 
5 recombinant heparanase, for screening for anti-heparanase compounds. More particularly, the 
present invention provides a method of selecting IHA (Inhibitors of Heparanase Activity). In 
addition, the present invention provides a purified heparanase, particularly suitable for use to 
identify and isolate anti-heparanase compounds as well as for other known uses of l^paranases. 
such as its use to accelerate wound healing or its use as an immobilized heparanase filter 
10 connected to extracorporeal devices to degrade heparin and neutralize its anticoagulant properties 
during surgery. 

BACKGROUND OF THE INVENTION 
Elevated heparanase activity has been documented in mobile, invasive ceUs. Examples 
include; invasive melanoma, lymphoma, mastocytoma, mammary adenocarcinoma, leukemia, 
15 and rheumatoid fibroblasts. This activity has also been documented in non-pathologic 

situations involving the migration of lymphocytes, neutrophils, macrophages, eosinphils and 
platelets. An inhibitor of heparanase would therefore broadly influence the invasive potential of 
these divCTse cells. 

Inhibition of heparan sulfate degradation would also inMbit the release of bound growth 
20 factors and other biologic response modifiers that would, if released, fiiel the growth of adjacent 
tissues and provide a supportive environment for cell growth (Rapraeger, et al.. Science 252: 
1705-1708, 1991). Inhibitors of heparanase activity would be of value in the treatment of 
arthritis, vasciilar restenosis, tumor growth and progression, and fibro-proliferative disorders. 

Until now, the obstacles to designing a screening assay to find inhibitors of mammalian 
25 heparanase have been the unavailability of a mammalian h^aranase that is purified to apparent 
homogeneity and the lack of information about the amino acid sequence or the 3-dimensional 
structure of the enzyme. Without the amino acid sequence, it has not been possible to produce 
recombinant mammalian heparanase to be used in large volume screening efforts. Knowledge 
of the tertiary and quaternary structures would facilitate rational design of IHA. This report 
30 overcomes obstacles relating to the sequence of the heparanase, and also provides a model for 
higher-order structure. 

Heparanase refers to a manmialian en;^me which can degrade heparin proteoglycans 
(HPG) and/or heparan sulfate proteoglycans (HSPG). 

Heparanase activity in mammalian cells is well known. It is found in various melanoma 
35 cells (Nakajima, et al.. Cancer Letters 31: 277-283, 1986), mammary adenocarcinoma cells 
(Parish, et al.. Int. J. Cancer, 40: 511-518, 1987), leukemic cells (Yahalom, et aL, Leukemia 
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Research 12: 711-717, 1988), mast ceUs (Ogren and Lindahl, /. Bio/. CW 250: 2690-2697, 
1975), macrophages (Savion,e. a/., /.CW/.P/.)'s£oZ.. 130: 85-92, 1987), mononiK:lear cells 
(Sewell, et al.. Biochenu J. 264: 777-783, 1989), neutrophUs (Matzner. et al., J. Leukocyte 
Biology 51: 519-524. 1992). T-ceUs (Vettel, et al., Eur. J. Immunol 111 2247-2251, 1991). 
5 platelets (Haimovitz-Friedman, et al. Blood 78: 789-796, 1991), eadoAelial ceUs (Godder, et 
al., J. CeU Physiol 148: 274-280, 1991). and placenta (Klein and von Figura. BBRC 73: 
569. 1976). 

WO 91/02977, incorporated herein by reference, describes a substantially, but partially, 
purified heparanase produced by cation exchange resin chromatography and the affmity 
10 absorbent purification of heparanase-containing cell extract. WO 91/02977 also describes a 
method promoting wound healing utilizing compositions comprising a "purified" form of 



15 



Others have proposed the use of purified bacterial heparanase, immobilized onto fdters 
and connected to extracorporeal devices, to degrade heparin and neutralize its anticoagulant 
properties post surgery (Freed, et al, Anru Biomed. Eng. 21: 67-76, 1993). 

U.S. Patent 4,882,318 describes heparanase-mhibiting compositions for preventing tumor 
metastasis. 

Haimovitz-Friedman et al {Blood 78: 789-796. 1991) describe an assay for heparanase 
acuvity that involves the culturing of endothelial cells in radiolabeled ^^SO^ to produce 
20 radiolabeled heparan sulfate proteoglycans, the removal of the cells which leaves the deposited 
extraceUular matrix that contains the ^Ss-HSPG. the addition of potential sources of heparanase 
activity, and the detection of possible activity by passing the supernatant firom the radiolabeled 
extracellular matrix over a gel filtraUon column and monitoring for changes of the size of the 
radiolabeled material that would indicate that HSPG degradation had taken place. Hiis assay 
25 does not have the capabiUty for large-scale screenmg of inhibitors. 

U^kB.yinaetal.iAnal Biochenu 196: 162-171. 1986) describe a soUd-phase substrate 
for the assay of melanoma heparanase activity. Heparan sulfate from bovine lung is chemically 
radiolabeled by reactmg it with [»*q-acetic anhydride. Free amino groups of the [^'^CJ-heparan 
sulfate were acetylaled and the reducmg termini were aminated. THe [l^C]-heparan sulfate was 
30 chemicaUy coupled to an agarose support via the introduced amine groups on the reducing 
terminL Hiis substrate is Umiied in that it is an extensively chemicaUy modified form of 
naturally occinring heparan sulfate. 

Khan and Newman (Anai.BiocAem. 196: 373-376, 1991) describe an indirect assay for 
heparanase activity, to this assay, heparin is quantitated by its abiUty to interfere with the color 
35 development between a protein and the dye Coomassi brilliant blue. Heparanase activity is 
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detected by the loss of this interference. This assay is limited in use for screening because it is 
so indirect that other non-h^arin compounds could also interfere with the protein-dye reaction. 

The CXC chemokine family (also called the intencrine a family) is one branch of the 
supergene "intmrine" cytokine family (Oppenheim, Ann. Rev. Bic^^ ?.m. 9: 617-648, 1991). 
5 It's members include platelet factor 4, platelet basic protein and derivatives, "^-10, gro(a,p,y), 
NAP-l/interleukin-8, mig, and ENA-78 (for review, see Miller and Krangel. Critical Reviews in 
Immunology 12: 17-46, 1992). The other branch, the CC chemokines or intCTcrine-3 fanuly, 
includes MlPla, MIPipjE/MCP-1 . RAhrreS, and MCAF. All membCTs of both branches of 
this chemokine family characteristic^y are basic heparin-binding polypeptides, display 

10 molecular weights betwerai 8 and 1 1 kD, share 20 - 50% homology, and fiinction broadly in 
pathologic situ?~^on£ characterized by inflammation and tissue remodeling. 

The proteolytically processed forms of piglet basic protein include CTAP-in, P- 
thromboglobulin, and NAP-2. P-thromboglobulin (Moore, et al, Biochim. Biophys. Acta. 379: 
360-369. 1975) and CTAP-ffl (Castor, et al.. Arthritis Rheum. 20: 859-868, 1977), were 

15 originally isolated firom activated supematants or lysates from outdated pit: ets. Using the 

techniques of subcellular fractionation and radioimmunoassay, p-thromboglobultn was identified 
as an a-granule protein that could be released upon activation (Kaplan, et al.. Blood 53: 604- 
618, 1979). Platelet basic protein itself was later isolated from fresh platelets, megakaryocytes, 
and HEL cells, an immortal human erythroleukemia cell line (Holt, et al.. Biochemistry 25: 

20 1988-1996, 1986; Holt, et al., Exp. Hematol. 16: 302-306, 1988). Walz and Baggiolini 
isolated the processed form of NAP-2 from, platelet-containing cultures of stimulated 
mononuclear cells (Walz, et al., J. Exp. Med. 170: 1745-1750. 1989). 

Material labeled as p-thromboglobulin is commracially available from Calbiochem, San 
Diego, CA (Cat # 605165). Olsus Laboratories. Cincinnati, OH (Cat # 41705). and 

25 Haematologic Technologies, Essex JcL, VT (CaL # HBTG-02100. Hie invaitors have 

determined, by using the "Purifrcation Assay," that the commercial preparation have heparanase 
activity at a level of 0.075 units/pg. Hiis activity is below the level of 1 unit/jig needed for the 
screening of anti-heparanase compounds in accordance with the assay of the subject invention. 
U.S. Patent 4,897,348 describes recombinant materials and methods for producing 

30 human connective tissue-activating peptide-Hl (CTAP-m) and analogs thereof. 

Transglutaminases catalyze the posttranslational modifrcation of proteins by 
transamidation of available glutamine residues. This action results primarily in the formation of 
epsUon-(gamma-glutamyl)ly sine cross-links (Greenberg, er aZ., FASEB J. 5: 3071-3076. 1991). 
This posttranslational modification has been reported to dramatically alter the action of some 

35 small proteins. ■ For exairq)Ie. a transglutaminase produces a glutamine-lysin cross-link in the 13 
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kD phospholipase A2 and increases its specific enzymatic activity (Cordella-Miele, et al., J. 
Biol. Chem. 265: 17180-17188, 1990). A transglutaminase cross-links another small molecule, 
interIeukin-2, and converts its activity to one that is cytotoxic to mature oligodendrocytes (Eitan 
and Schwartz, Science 261: 106-108,1993). The glutamine-lysine cross-link in a protein woxdd 
5 result in the loss of overall positive charge for that protein. The transglutaminases are optimally 
active and generally used under reducing conditions such as dithiothreitol. The concept that 
glutamine-lysine cross-linking alters the activity of these small proteins may be applicable to 
other small molecules as well. 

SUMMARY OF THE INVENTIQN 

10 The present invration discloses a method of screening for compotmds having anti- 

heparanase activity (AHA coit^unds), i.e. inhibitors of hepziamse activity (IHA), comprising 
the steps of: contacting a potential AHA compound with radiolabeled heparin/heparan sulfate 
and hepaxanase for a time and under such conditions sufficient to allow for inhibition of 
heparanase activity; detecting inhibition of hepaianase activity; and selecting compounds that 

15 inhibit heparanase activity. The present invention also discloses the amino acid sequence 
identity of the h^aranase that has been purified to homogeneity by chromatography wada 
reducing conditions. Identification of the amino acid sequence of the protein ^^liich contains 
hepaianase activity is crucial for the production of recombinant mammalian h^aranase. 

DETAILED DESCRIPTION OF THE INVENTION 

20 The present invention provides a pinified heparanase. and a method for producing it. 

The heparanase so produced has an activity of greater than 20 mats/ pg protein, preferably 
greater than 50 units heparanase activity per ]»g protein (1 unit =1% cpm < 30 K/hr using the 
'Purification Assay" (Example 2, Part D). 

In addition, the present invention provides recombinant heparanase and a means for 

25 producing it The term "purified heparanase" as used in the specification and claims includes 
the recombinant heparanase as described in the subject application. The recombinant heparanase 
of the subject invention can be used for the same purposes and in the same manner as the 
purified heparanase. 

The purified heparanase of the present invention has an isoelectric point of less than 5.5 
30 (preferably about 4.8 - 5.1) and preferably is activated by treatment with transglutanunase using 
reducing conditions. 

The recombinant heparanase of the present invention has an isoelectric point of less than 
5.5 (preferably about 4.8 - 5.1), and is isolated under reducing conditions and is activated by 
treatment with transglutaminase. 
35 Suitable transglutaminases that may be used for this purpose include Activated Factor 

Xnia, guinea pig liver transglutaminase, epidermal transglutaminase, kwatinocyte 
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transglutaminase, and tissue transglutaminase. 

The heparanase of the present invention has the amino acid sequence (SEQ. ID. NO: 1) 

ofi 

Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu. Asp Set Asp Leu. Tyr Ala 
5 5 10 15 

Glu Leu Arg Cys Met Cys lie Lys Thr Thr Ser Gly lie His Pro Lys 
20 25 30 

10 Asn ne Ghi Ser Leu Glu Val De Gly Lys Gly Thr His Cys Asn Ghi 
35 40 45 

Val Glu Val He Ala Thr Leu Lys Asp Gly Aig Lys He Cys Leu Asp 
50 55 60 

15 

Pro Asp Ala Pro Arg He Lys Lys He Val Gin Lys Lys Leu Ala Gly 
65 70 75 80 



Asp Glu Ser Ala Asp 
20 85 

encoded by the cDNA sequence (SEQ ID NO: 2) of: 

1 AACTTGGCGA AAGGCAAAGA GGAAAGTCTA GACAGTGACT TGTATGCTGA 
51 ACrCCGCTGC ATGTGTATAA AGACAACCTC TGGAATTCAT CCCAAAAACA 
101 TCCAAAGTTT GGAAGTGATC GGGAAAGGAA CCCATTGCAA CCAAGTCGAA 
25 151 GTGATAGCCA CACTGAAGGA TGGGAGGAAA ATCTGCCTGG ACCCAGATGC 
201 TCCCAGAATC AAGAAAATTG TACAGAAAAA ATTGGCAGGT GATGAATCTG 
251 CTGAT 

which conesponds to the cDNA sequence and derived amino acid sequ^ce of CTAP-HI. See 
Wenger et al.. Blood, 73: 1498-1503. 1989. 
30 In another aspect, the present invention provides a h^aranase having the amino acid 

sequence (SEQ ID NO: 3) of: 

Ser Ser Thr Lys Gly Ghi Thr Lys Arg Asn Leu Ala Lys Gly Lys Glu 
5 10 15 

35 Glu Sex Leu Asp Ser Asp Leu Tyr Ala Glu Leu Arg Cys Met Cys He 
20 25 30 

Lys Thr Thr Ser Gly He His Pro Lys Asn He Ghi Ser Leu Glu Val 
35 40 45 

40 

He Gly Lys Gly Thr His Cys Asn Ghi Val Glu Val He Ala Thr Leu 
50 55 60 



Lys Asp Gly Arg Lys He Cys Leu Asp Pro Asp Ala Pro Arg He Lys 
45 65 70 75 80 
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Lys ne Val Gin Lys Lys Leu Ala Gly Asp Glu Ser Ala Asp 
85 90 

encoded by the cDNA sequence (SEQ ID NO: 4) of: 

1 TCCrCCACCA AAGGACAAAC TAAGAGAAAC TTGGCGAAAG GCAAAGAGGA 

5 51 AAGTCTAGAC AGTGACTTGT ATGCTGAACr CCGCTGCATG TGTATAAAGA 

101 CAACCTCTGG AATTCATCCC AAAAACATCC AAAGTTTGGA AGTGATCGGG 

151 AAAGGAACCC ATTGCAACCA AGTCX5AAGTG ATAGCCACAC TGAAGGATGG 

201 GAGGAAAATC TGCCTGGACC CAGATGCTCC CAGAATCAAG AAAATTGTAC 

251 AGAAAAAATT GGCAGGTGAT GAATCTGCTG AT 

10 which coireqrands to the cDNA sequence and derived amino acid sequence of platelet basic 

protein. See Wenger et al.. Blood, 73: 1498-1503, 1989 as well as Walz and Baggiolini, BBRC 

159: 969-981, 1989; Castor, era/., BB/?C 163: 1071-1078,1989. 

In another aspect, the present invention provides a heparanase having the amino acid 

sequence (SEQ ID NO: 5) ofi 

15 Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala Glu Leu Arg Cys 
15 10 15 

Met Cys lie Lys Thr Thr Ser Gly He His Pro Lys Asn lie Ghi Ser 
20 25 30 

20 

Leu Glu Val De Gly Lys Gly Thr His Cys Asn Gin Val Glu Val He 
35 40 45 

Ala Thr Leu Lys Asp Gly Arg Lys De Cys Leu Asp Pro Asp Ala Pro 
25 50 55 60 

Arg ne Lys Lys De Val Gbi Lys Lys Leu Ala Gly Asp Glu Ser Ala 
65 70 75 80 

30 Asp 

encoded by the cDNA sequence (SEQ ED NO: 6) of. 

1 GGCAAAGAGG AAAGTCTAGA CAGTGACITG TATGCTGAAC TCCGCTGCAT 
51 GTGTATAAAG ACAACCTCTG GAATTCATCC CAAAAACATC CAAAGTTIGG 
101 AAGTGATCGG GAAAGGAACC CATTGCAACC AAGTCGAAGT GATAGCCACA 

35 151 CTGAAGGATG GGAGGAAAAT CTGCCTGGAC CCAGATGCTC CCAGAATCAA 
201 GAAAATTGTA CAGAAAAAAT TGGCAGGTGA TGAATCTGCT GAT 
which corresponds to the cDNA sequence and derived amino acid sequrace p-thromboglobulin 
OTG). See Wenger et al., BlooH, 73: 1498-1503, 1989. 

In another aspect, the present invention provides a heparanase having the amino acid 

40 sequence (SEQ ID NO: 7) of. 
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Glu Leu Arg Cys Met Cys He Lys Thr Thr Ser Gly He His Pro Lys 
15 10 15 

Asa He Gin Ser Leu Glu Val He Gly Lys Gly Thr His Cys Asn Gin 
5 20 25 30 

Val Glu Val He Ala Thr Leu Lys Asp Gly Arg Lys He Cys Leu Asp 
35 40 45 

10 Pro Asp Ala Pro Arg He Lys Lys He Val Gin Lys Lys Leu Ala Gly 
50 55 60 

Asp Glu Ser Ala Asp 
65 

15 

encoded by the cDNA sequence (SEQ JD NO: 8) of: 

1 GAACTCCGCT GCATGTGTAT AAAGACAACC TCTGGAATTC ATCCCAAAAA 
51 CATCCAAAGT TTGGAAGTGA TCGGGAAAGG AACCCATTGC AACCAAGTCG 
101 AAGTGATAGC CACACTGAAG GATGGGAGGA AAATCTGCCT GGACCCAGAT 
20 151 GCTCCCAGAA TCAAGAAAAT TGTACAGAAA AAATTGGCAG GTGATGAATC 
201 TGCTGAT 

which corresponds to the cDNA sequence and derived amino acid sequence of neutrophil 
activating peptide-2. 

Hie foregoing amino acid sequences conespond to the products of a single gene caUed 

25 platelet basic protem (Walz and Baggiolini, BBRC 159: 969-981, 1989; Castor, et al.. BBRC 
163: 1071-1078, 1989). The complete gene sequence of platelet basic protein is weU known. 
See, for exanq)le. Wenger et al.. Blood, 73: 1498-1503, 1989 and Proc. Natl Acad. ScL USA, 
90, 3660-3664, 1993. 

The present invention also provides heparanase having the amino add sequences of 

30 other memb^s of the CXC chemokine family [including Platelet factor 4 (SEQ. ID NO. 12), 
yW-lO (SEQ. ID NO. 14), gro/MGSA (SEQ. ID NO. 16), gro-p/MIP-2a (SEQ. ID NO. 18), 
gr (SEQ. ID NO. 20), Interleukin-8/NAP-l (SEQ. ID NO. 22) and ENA-78 (SEQ. ID 

NO. 24)] as weU as members of the CC chemokine fanuly [including MlP-la (SEQ. ID NO. 
26), MIP-lp (SEQ. H) NO. 28), 1-309 (SEQ. ID NO. 23), MCP-1 (SEQ. ID NO. 32), MCP-3 

35 (SEQ. ID NO. 34), RANTES (SEQ. ID NO. 36), fic (SEQ. ID NO. 38) and MCP-2 (SEQ. ID 
NO. 40)1; purified to apparent homogeneity, prepared in the presence of reducing conditions, 
and activated by treatment with transglutaminase. Suitable transglutaminases that may be used 
for this purpose include Activated Fartor XHIa, guinea pig liver transglutaminase, epidermal 
transglutaminase, keratinocyte transglutaminase, and tissue transglutaminase. 

40 In another aspect, the present invention provides a heparanase having the amino acid 
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sequence (SEQ ID NO; 12) of: 

Met Ser Ser Ala Ala Gly Phe Cys Ala Ser Arg Pro Gly Leu Leu Phe Leu Gly Leu Leu 
Leu Leu Pro Leu Val Val Ala Phe Ala Ser Ala Glu Ala Glu Glu Asp Gly Asp Leu Gin 
Cys Leu Cys Val Lys Thr Thr Ser Gin Val Arg Pro Arg His He Thr Ser Leu Glu Val 
5 He Lys Ala Gly Pro His Cys Pro Thr Ala Gin Leu lie Ala Thr Leu Lys Asn Gly Arg 

Lys He Cys Leu Asp Leu Gin Ala Pro Leu Tyr Lys Lys De Be Lys Lys Leu Leu Glu 
Ser 

encoded by the cDNA sequence (SEQ ED NO: 13) of: 

1 GCGCAGCATG AGCTCCGCAG CCGGGTTCTG CGCCTCACXJC CXZCGGGCTGC 
10 51 TGTTCCTGGG GTIXSCTGCTC CTGCCACITG TGGTCGCCTT CGCCAGC 

101 GAAGCTGAAG AAGATGGGGA CCTGCAGTGC CTGTGTGTGA AGACCACCTC 
151 CCAGGTCCGT CCCAGGCACA TCACCAGCCT GGAGGTGATC AAGGCCGGAC 
201 CCCACTGCCC CACTGCCCAA CTGATAGCCA CGCTGAAGAA TGGAAGGAAA 
251 ATTTGCTTGG ACCTGCAAGC CCCGCTGTAC AAGAAAATAA TTAAGAAACT 
15 301 TTTGGAGAGT TAGCTACTAG CTGCCTACGT GTGTGCATTT GCTATATAGC 

351 ATACTTCnr TTTCCAGTTT CAATCTAACT GTGAAAGAAA CTTCTGATAT 
401 TTGTGTTATC dTATGATTT TAAATAAACA AAATAAATC 
which corresponds to the cDNA sequence and derived amino acid sequence of platelet factor 4. 
See Poncz et aL, Blood 69, 219-223 (1987). 
20 In axMibsx a^iect, the present invention provides a heparanase having the amino acid 

sequence (SEQ ID NO: 14) of. 

Met Asn Ghi Thr Ala De Leu Be Cys C^s Leu Be Phe Leu Thr Leu Ser Gly lie Ghi Gly 
Val Pro Leu Ser Aig Thr Val Arg Cys Thr Cys lie Ser He Sct Asn Gin Pro Val Asn Pro 
Val Asn Pro Arg Ser Leu Glu Lys Leu Glu He De Pro Ala Ser Gin Phe Cys Pro Arg 
25 Val Glu He lie Ala Thr Met Lys Lys Lys Gly Glu Lys Arg Cys Leu Asn Pro Glu Ser 

Lys Ala De Lys Asn Leu Leu Lys Ala Val Ser Lys Glu Met Ser Lys Arg Ser fto 
encoded by the cDNA sequence (SEQ ID NO: 15) of: 

1 GAGACATTCC TCAATTGCIT AGACATATTC TGACJCCTACA GCAGAGGAAC 
51 CrCCAGTCTC AGCACCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA 
30 101 TCnrCTGAC TCTAAGTGGC ATTCAAGGAG TACCTCTCrC TAGAACCGTA 

151 CGCTGTACCT gc:atcagcat TAGTAATCAA CCTGTTAATC CAAGGTCTTT 
201 AGAAAAACTT GAAATTATTC CTGCAAGCCA ATTTrGTCCA CGTGTTGAGA 
251 TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 
301 TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA 
35 351 AAGATCTCCT TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG 

401 GATGGACCAC ACAGAGGCTG CCTCTCCCAT CACITCOCTA CATGGAGTAT 
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451 ATGTCAAGCC ATAATTGTTC TTAGTTTGCA GTrACACTAA AAGGTGACCA 
501 ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA GGTTAATGTT 
551 CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 
601 GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGACC CTGCTTCAAA 
which corresponds to the cDNA sequence and derived amino acid sequence -^-10. See Luster 
et al.. Nature 315, 672-676 (1985). 

In another aspect, the present invration provides a heparanase having the amino acid 
sequence (SEQ ID NO: 16) of: 

Met Ala Arg Ala Ala Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu Arg Val Ala Leu 
Leu Leu Leu Leu Leu Val Ala Ala Gly Arg Arg Ala Ala Gly Ala Ser Val Ala Thr GIu 
Leu Arg Cys Ghi Cys Leu G]n Thr Leu Gin Gly lie His Pro Lys Asn lie Gbi Sa Val 
Asn Val Lys Ser Pro Gly Pro His Cys Ala Gki Thr Glu ^ , He Ala Thr Leu Lys Asn 
Gly Arg Lys Ala Cys Leu Asn Pro Ala Ser Pro lie Val Lys Lys lie He Glu Lys Met 
Leu Asn Ser Asp Lys Ser Asn 
encoded by the cDNA sequence (SEQ ID NO: 17) of; 

1 CTCGCCAGCT OrrcCGCTCC TCTCACAGCC GCCAGACCCG CCTGCTGAGC 
51 CCCATGGCCC GCGCTGCTCT CTCCGCCGCC CCCAGCAATC CCCGGCTCCT 
101 GCGAGTGGCA CTGCTGCTCC TGCTCCTGGT AGCCGCTGGC CGGCGCGCAG 
151 CAGGAGCGTC CGTGGCCACr GAACTGCGCT GCCAGTGCTT GCAGACCCTG 
201 CAGGGAATTC ACCCCAAGAA CATCCAAAGT GTGAAOGTGA AGTCCCCCGG 
251 ACCCCACTGC GCCCAAACCG AAGTCATAGC CACACTCAAG AATGGGCGGA 
301 AAGCITGCCT CAATCCTGCA TCCCCCATAG TTAAGAAAAT CATCGAAAAG 
351 ATGCTGAACA GTGACAAATC CAACTGACCA GAAGGGAGGA GGAAGCTCAC 
401 TGGTGGCTGT TCCTGAAGGA GGCCCTGCCC TTATAGGAAC AGAAGAGGAA 
451 AGAGAGACAC AGCTGCAGAG GCCACCTGGA TTGTGCCTAA TGTGTITGAG 
501 CATCGCITAG GAGAAGTCTT CrATTTATTT ATTTATTCAT TAGTITTGAA 
551 GATrCTATGT TAATATTTTA GGTGTAAAAT AATTAAGGGT ATGATTAACT 
601 CTACCTGCAC ACTGTCCTAT TATATTCATT CmTTGAAA TGTCAACCCC 
651 AAGTTAGTrC AATCTGGATT CATATTTAAT TTGAAGGTAG AATGTnTC X 
701 AATGTTCrCC AGTCATTATG TTAATATTTC TGAGGAGCCT GCAACATGCC 
751 AGCCACTGTG ATAGAGGCTG GCGGATCCAA GCAAATGGCC AATGAGATCA 
801 TTGTGAAGGC AGGGGAATGT ATGTGCACAT CTGTTTTGTA ACTGTTTAGA 
851 TGAATGTCAG TTGTTATTTA TTGAAATGAT TTCACAGTGT GTGGTCAACA 
901 TTTCrCATGT TGAAACITTA AGAACTAAAA TGTTCTAAAT ATCCCTTGGA 
951 CATTTTATGT CTTTCTTGTA AGGCATACTG CCTTGTTTAA TGGTAGTTTT 
1001 ACAGTGTTTC TGGCTTAGAA CAAAGGGGCT TAATTATTGA TGTTTTCGGA 
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which coiresponds to the cDNA sequence and derived amino acid sequence of gro/MGSA 

(melanoma growth stimulatory activity). See Anisowicz et al., Proc. Nati. Acad. Sci. U.S.A. 

84, 7188-7192 (1987). 

In another aspect, the present invention provides a heparanase having the amino acid 

sequence (SEQ ID NO: 18) of: 

Met Ala Arg Ala Thr Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu Arg Val Ala Leu 
Leu Leu Leu Leu Leu Val Ala Ala Ser Arg Arg Ala Ala Gly Ala Pro Lys Ala TTir Glu 
Lys Arg Cys Ghi Cys Lys Gbi Thr Leu Gin Gly De ffis Leu Lys Asn De Gin Ser Val 
Lys Val Lys Ser Pro Gly Pio His Cys Ala Gin Thr Glu Val He Ala Thr Leu Lys Asn 
Gly Ghi Lys Ala Cys L^u Asn Pro Ala Ser Pro Met Val Lys Lys lie lie Glu Lys Met 
Leu Lys 

encoded by the cDNA sequraice (SEQ ID NO: 19) of: 

1 CTcrccrccr cgcacagccg ctcgaaccgc ctgctgagcc ccatggcccg 

51 CGCCACGCrC TCCGCCGCCC CCAGCAATCC CCGGCTCCTG CGGGTGGCGC 
101 TGCTGCTCCr GCTCCTGGTG GCCGCCAGCC GGCGCGCAGC AGGAGCGCCC 
151 CrGGCCACTG AACTGCGCTG CCAGTGCTTG CAGACCCTGC AGGGAATTCA 
201 CCrCAAGAAC ATCCAAAGTG TGAAGGTGAA GTCCCCCGGA CCCCACTGCG 
251 CCCAAACCGA AGTCATAGCC ACACTCAAGA ATGGGCAGAA AGCITGTCrC 
301 AACCCCGCAT CGCCCATGGT TAAGAAAATC ATCGAAAAGA TGCTGAAAAA 
» 351 TGGCAAATCC AACTGACCAG AAGGAAGGAG GAAGCITAIT GGTGGCrGTT 
401 CCTGAAGGAG GCCCTGCCCT TACAGGAACA GAAGAGGAAA GAGAGACACA 
451 GCIGCAGAGG CCACCTGGAT TGCGCCTAAT GTGTTTGAGC ATCACITAGG 
501 AGAAGTCTTCTATrrATTrATITATTrATTTATTTGTITGTnTAGAAGA 
551 TTCTATGITA ATATTITATG TGTAAAATAA GGTTATGATT GAATCTACIT 
5 601 GCACACrCrCCCATrATATTTATrGTITATTITAGGTCAAACCCAAGTrA 

651 GTrcAATCCrGATrCATATTTAATTTGAAGATAGAAGGTTTGCAGATATT 
701 cnrrAGTCATTTCTrAATATTrcTTCGTGATGACATATCACATGTCAGCC 
751 ACTGTCATAG AGGCTGAGGA ATCCAAGAAA ATGGCCAGTG AGATCAATGT 
801 GACGGCAGGG AAATGTATGT GTGTCrATIT TGTAACTGTA AAGATGAATG 
0 851 TCAGTrGTTATTTATTGAAATGATrrCACAGTGTGTGGTCAACATTrCTC 
901 ATC1TGAAGC nTAAGAACT AAAATGTTCT AAATATCCCT TGGACATnT 
951 ATGTCnrCT TGTAAGGCAT ACrGCCTTGT TTAATCTTAA TTATGCAGTG 
1001 TTTCCCrCTG TGTTAGAGCA GAGAGGTITC GATATTTATT GATGITTTCA 
1051 CAAAGAACAG GAAAATAAAA TATITAAAAA T 
J5 which corresponds to the cDNA sequence and derived ammo acid sequence gro-P^MIP-2a 

(macrophage inflammatory protein 2-a). See Tekamp-Olson et al.. J. Exp. Med. 172, 911-919 
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(1990). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 20) of. 

Met Ala His Ala Thr Leu Ser Ala Ala Pro Ser Asa Pro Arg Leu Leu Arg Val Ala Leu 
5 Leu Leu Leu Leu Leu Val Ala Ala Ser Arg Arg Ala Ala Gly Ala Ser Val Val TTir Glu 

Leu Arg Cys Ghi Cys Leu Gin Thr Leu Gin Gly lie His Leu Lys Asn lie Ghi Ser Val 
Asn Val Aig Ser Pro Gly Pro His Cys Ala Ghi Thr Glu Val De Ala Thr Leu Lys Asn 
Gly Lys Lys Ala Cys Leu Asn Pro Ala Ser Pro Met Val Ghi Lys De lie Glu Lys lie Leu 
Asn Lys Gly Ser Thr Asn 
10 encoded by the cDNA sequence (SEQ ID NO: 21) oft 

1 CrCGCACAGC TTCCCGACGC GTCTGCTGAG CCCCATGGCC CACGCCACGC 
51 TCTCCGCCGC CCCCAGCAAT CCCCGGCrCC TGCGGGTGGC GCTGCTGCTC 
101 CTGCTCCTGG TGGCCGCCAG CCGGCGCGCA GCAGGAGCGT CCGTGGTCAC 
151 TGAACTGCGC TGCCAGTGCT TGCAGACACT GCAGGGAATT CACCTCAAGA 
15 201 ACATCCAAAG TGTGAATGTA AGGTCCCCCG GACCCCACTG CGCCCAAACC 

251 GAAGTCATAG CCACACTCAA GAATGGGAAG AAAGCTTGTC TCAACCCCGC 
301 ATCCCCCATG GTTCAGAAAA TCATCGAAAA GATACTGAAC AAGGGGAGCA 
351 CCAACTGACA GGAGAGAAGT AAGAAGCITA TCAGCGTATC ATTGACACTT 
401 CCTGCAGGGT GGTCCCTGCC CTTACCAGAG CTGAAAATGA AAAAGAGAAC 
20 451 AGCAGCTTTC TAGGGACAGC TGGAAAGGAC TTAATGTGTT TGACTATTTC 
501 TTACGAGGGT TCTACTTATT TATGTATTTA TTTTTGAAAG CTTGTATTTT 
551 AATATTTTAC ATGCTGTTAT TTAAAGATGT GAGTGTGTTr CATCAAACAT 
601 AGCrCAGTCC TGATTATTTA ATTGGAATAT GATGGGTTTr AAATGTGTCA 
651 TTAAACTAAT ATTTAGTGGG AGACCATAAT GTGTCAGCCA CCTTGATAAA 
25 701 TGACAGGGTG GGGAACTGGA GGGTGGGGGG ATTGAAATGC AAGCAATTAG 
751 TGGATCACTG TTAGGGTAAG GGAATGTATG TACACATCTA TTTnTATAC 

801 TmrmTA aaaaaagaat gtcagttgtt atttattcaa attatctcac 

851 ATTATGTGTT CAACATTTTT ATGCTGAAGT TTCCCTTAGA CATnTATGT 
901 CTTGCrTGTA GGGCATAATG CClTGnTAA TGTCCATTCT GCAGCGTTTC 
30 951 TCTTTCCCTT GGAAAAGAGA ATTTATCATT ACTGTTAC 

which corresponds to the cDNA sequence and derived amino acid sequence gro-'V/MIP-2P " 
(macrophage inflammatory protem 2-p). See Tekamp-Olson et al., J. Exp. Med. 172, 911-919 
(1990). 

In another aspect, the present invention provides a heparanase having the amino acid 
35 sequence (SEQ ID NO: 22) of. 

Met Thr Ser Lys Leu Ala Val Ala Leu Leu Ala Ala Phe Leu He Ser Ala Ala Leu Cys 
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Glu Gly Ala Val Leu Pro Arg Ser Ala Lys Glu Leu Arg Cys Gin Cys lie Lys Thr Tyr 
Ser Lys Pro Phe His Pro Lys Phe He Lys Glu Leu Arg Val lie Glu Ser Gly Pro ffis Cys 
Ala Asn Thr Glu He He Val Lys Leu Ser Asp Gly Arg Glu Leu Cys Leu Asp Pro Lys 
Glu Asn Trp Val Gin Arg Val Val Glu Lys Phe Leu Lys Arg Ala Glu 
encoded by the cDNA sequence (SEQ ED NO: 23) of: 

1 ATGACTTCCA AGCTGGCCGT GGCTCrCTTG GCAGCCTTCC TGATTTCTGC 
51 AGCTCTGTGT GAAGGTGCAG TTTTCCCAAG GAGTGCTAAA GAACTTAGAT 
101 GTCAGTGCAT AAAGACATAC TCCAAACCTT TCCACCCCAA ATTTATCAAA 
151 GAACTGAGAG TGATTGAGAG TGGACCACAC TGCGCCAACA CAGAAATTAT 
201 TGTAAAGCTT TCTGATGGAA GAGAGCTCTG TCTGGACCCC AAGGAAAACT 
251 GGGTGCAGAG GGTTCTGGAG AAGTTnTGA AGAGGGCTGA G 
which corresponds to the cDNA sequence and derived amino acid sequence Interleukin-8/NAP-l 
(neutropha activating protein-1). See Kunser et al.. Kidney InL 39, 1240-1248 (1991). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 24) ofi 

Ala Gly Pro Ala Ala Ala Val Leu Arg Glu Lys Arg Cys Val Cys Leu Gin Thr Thr Ghi 
Gly Val His Pro Lys Met lie Ser Asn Leu Gin Val Phe Ala lie Gly Pro Gin Cys Ser 
Lys Val Glu Val Val Ala Ser Leu Lys Asn Gly Lys Glu He Cys Leu Asp Pro Glu Ala 
Pro Phe Leu Lys Lys Val He Gin Lys He Leu A^ Gly Gly Asn Lys Glu Asn 
encoded by the cDNA sequence (SEQ ID NO: 25) of: 

1 GTGTTGCGGG AACTGCGGTG CGTGTGTITA CAGACCACGC AGGGAGTTCA 
51 TCCCAAAATG ATCAGTAATC TGCAAGTGTT CGCCATAGGC CCACAGTGCT 
101 CCAAGGTGGA AGTGGTACJCC TCCCTGAAGA ACGGGAAGGA AATTTGTCTT 
151 GATCCAGAAG CCCCmTCT AAAGAAAGTC ATCCAGAAAA TCCTCGACGG 
201 CGGCAACAAA GAAAAC 
which conesponds to the cDNA sequence and derived amino acid sequence of a novel 
inflanunatoty peptide (ENA-78) with homology to interleukin 8. See Walz et al., J. Exp. Med. 
174, 1355-1362 (1991). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 26) of: 

Met Gin Val Ser Thr Ala Ala Leu Ala Val Leu Leu Cys Thr Met Ala Leu Cys Asn Ghi 
Val Leu Ser Ala Pro Leu Ala Ala Asp Thr Pro Thr Ala Cys Cys Phe Ser Tyr Thr Ser 
Arg Gin lie Pro Gin Asn Phe lie Ala Asp Tyr Phe Glu Thr Ser Ser Gfai Cys Ser Lys 
Pro Ser Val lie Phe Leu Thr Lys Arg Gly Arg Ghi Val Cys Ala Asp Pro Ser Glu Glu 
Trp Val Ghi Lys Tyr Val Ser Asp Leu Glu Leu Ser Ala 
encoded by the cDNA sequence (SEQ ID NO: 27) of: 
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1 GAATTCAAGG CCTGTCCTGG 1TTGGTCCCA ATTTACCTTT ATCATCCATA 
51 TTCACCCCCA CTGCTCTGCA GCTCCACTGA AGCACCCCCT CnTCCTCTG 
101 AGCCACAATG TCACACCCAG GACTCTGCCT CAGCTGGGCC TCCACTGCCX: 
151 ACCCATCTAT AGATGCCTAA ATCCCGGGCA GTTATCCAGA CACAACTAAA 
5 201 GTTCCATCCC TTCCATGAAG CCTTCCCCAA CCCTCTGGTG GAAGGTCACT 

251 TCTTCCTCAT GGGGTTCTGA GCTTTCATTT CTTTTTCTAC TAAGAGTTTT 
301 ACAATTACCT GTTCATACAC TCTACCTGCC CCCATGAGAC CAGGGGCATC 
351 TCAGAAACAA AGATCATTAA AACCAACTAA ATCTATTTCT CATTATAAAA 
401 TGAGATATGC TGATTGATTG CAAAATAATA AAATAACAAA GTATGGAAAA 

10 451 GAAAAAAAAA AGCATATAAT CTGGCTGAGA AGGTAGAGAC CCTTCCACAC 

501 CACTGAAATT ATGTGTTGAA AAGAATAAGG AAAAAACTGC TTCAGTTTGG 
551 CATTATTTAT GTAAGTATAG TATAGGATCC TTAAAATGGT TCAAAGAAAT 
601 GGGAAATCAA GACTTCATIT TGGCAAAGCC ATTGAACAGA AACTGTAGCA 
651 TATTTATCAG TAATTTCTTr CAGATTAAAC AACTGACAAC AACCCACm 

15 701 TCAACCAGTG ATGTTGGAAA TGTTTTAAAA CAAAATTAGT TCATAAATTT 

751 GTGGGTTGAC CAAGAAGGTA ATAAAGTCTC ACTAAATAAA ATGAGGAAAA 
801 TTCAGAAAAA GAAAAAAATA AGAAAATAAA TCACCCATGG ATCTAAGCAC 
851 TATTCATTCT TTAAGGCATG TATTTCCAAG CCmTAATT TTTTCATGCC 
901 TAGAGTTGGC ATGGCATATA TATATCTTTA TACAATTCIT CAAATTTTAT 

20 951 AGAATTTGTA TAATGTTTTA TCTTGCl'lTr I'lTlTAACCA CTGATGTTAT 

1001 AAGCATATTT ATGCCACTTC ATTCACGTTA GAGACTTAAT AATAAAGGAT 
1051 CTTGTGGATA ATTTATCATT CCCTGATAGA GAAAAATTTA GCnTGCTTA 
1101 TTTTAGAGTr ATAAATGATG CIXKK3TCAGG TATCTTTATG Tl^ 
1151 GCTCCATATT TGGGTTGTTT CCACAGAACT CTTTCCAGAA ATGCTTTITC 

25 1201 TAGGTTAATG GCTACACATA TTTCTAGGCA CCTGACATAC TGACACCCAC 
1251 CTCTAAAGTA TnTTATGAT CCACAACTAG CGTTTAACAC AGCGCCCCAG 
1301 TCACTCCGAG ACTAATAAAT AGACAAATGA CTGAAACGTG ACCTCATGCT 
1351 TICTATTCCT rcAGCTTTCA TTGAGTTCCT TTCC^^ 

1401 GTTGTCTAGC CCTCCACAGC ATCAGCCCAT TGACCCTATC CITGTGGTTA 
30 1451 TAGCAGCTGAGGAAGCAGAATT/ AGCTCrGTGGGAAGGA ATGGGGCTGG 
1501 AGAGTTCATG CATAGACCAA TTClTmTT TnTTTTnT TGAGATGGAG 
1551 TTTCACTTTT GTTGCCCAGG CTGGAGTGCA ATGGCATGAT CTCAGCTCAC 
1601 CACAGCCCCC ACXrTCCTGGG TTCAAGCGAT TCTCCTGCCC TCAGCCTCCX: 
1651 GAGTAGCTGG GATTACAGGC ATGTGCCACC ACGCCTGACT ACnTTGTAT 
35 1701 TTTTAGTAGA GATGGAGrrr CrCXTTCITG GTCAGGTTGG TCTCAAACTC 
1751 CTGACCTCAG GTGATCTGCA GCCTCGGCXTT CCAAAGTGTT GGGATTACAG 
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1801 GTGTGAGCGA CCATGCCTGG CTGCATAGAC CAGTTCTTAT GAGAAGGGAT 
1851 CAACTAAGAA TAGCCTTGGG TTGACACACA CCCCTCITCA CACTCACAGG 
1901 AGAAACCCCA TGAAGCTAGA ACCAGTCATG AGTTGAGAGC TGAGAGTTAG 
1951 AGAGTAGCTC AGAGATGCTA TTCTTGGATA TCCTGAGCCC CTGTGGTCAC 
2001 CAGGGACCCT GAGTTGTGCA ACACTCAGCA TGACAGCATC ACTACACTTA 
2051 AAAATTTCCC TCCTCACCCC CAGATTCCAT TTCCXrCATCC GCCAGGGCTG 
2101 CXTTATAAAGA GGAGAGATGG CTTCAGACAT CAGAAGGACG CAGGCAGCAA 
2151 AGAGTAGTCA GTCCCTTCTT GGCTCTGCTG ACACTOGAGC CCACATTCCA 
2201 TCACCTGCrC CCAATCATGC AGGTCTCCAC TGCTGCXXTT GCXXjTCCTCC 
2251 TCTGCACCAT GGCTCTCTGC AACCAGGTCC TCTCTGCACC ACGTGAGTCC 
2301 ATGTTGTTGT TGTGGGTATC ACCACTCTCT GGCCATGGTT AGACCACATC . 
2351 AGTCnrnT TGTGGCGTGA GAGGCCCCGA AGAGAAAAGA AGGAAGTTCT 
2401 TAAAGCX5CTG CCAAACACXTT TGGTCmTT CITCACAACT TTTATmTA 
2451 TCTCTAGAAG GGGTCTTAGC CCTCCTAGTC TCCAGGTATG AGAATCTAGG 
2501 CAGGGGCAGG GGAGTTACAG TCCCTTGTAC AGATAGAAAA ACAGGGTTCA 
2551 AAACGAATCA GTTTGCAAGA GGCAGAATCC AGGGCTGCTT ACTTCCCAGT 
2601 GGGGTCTGTT CTTCACrCTC CAGCTCACCC TAGTCTCXX:A GGAGCCCTGT 
2651 CCCITGGATG TCTTATGAGA GATGTCCAGG GCTTCTCITG GGCTGGGGTA 
2701 TGACTTCTTG AACCGACAAA ATTCCATGAA GAGAGCTAAG AGAACAGTCC 
2751 ATTCAGGTAT CTGGATCACA TAGAGAAACA GAGAACXXAC TATGAAGAGT 
2801 CAAGGGGAAA GAGGAATATA GACAGAAACA AAGAGACATT TCTCTGCAAA 
2851 ACCCCCCAAA TGCXTTGCAG TCACTTGGTC TGAGCAAGCC TGCCCTCCTC 
2901 AACCACTCAG GGATCAGAAG CTGCCTGGCC TnTCITCTG AGCTGTGACT 
2951 TGGGCTTATT CTCTCCnTC TCCGCAGTTG CTGCTGACAC GCCGACCGCC 
3001 TGCTGCTTCA GCTACACCTC CCGACAGATT CCACAGAATT TCATAGCTGA 
3051 CTACTTTGAG ACGAGCAGCC AGTGCTCCAA GCCCAGTGTC ATGTAAGTGC 
3101 CAGTCTTCCr GCTCACCTCT AGGGAGGTAG GGAGTGTCAG GGTGGGGGCA 
3151 GAAACAGGCC AGAAGGCCAT CCTGGAAAGG CCCAGCCTTC AGGAGCCTAT 
3201 CGGGGATACA GGACGCAGGG CACTGAGGTG TGACCTGACT TGGGGCTGGA 
3251 GTGAGGTGGG TGTTACAGAG TCAGGAAGGG CTGCCTCAGG CCAGAGGAAA 
3301 GGGACAGGAA GAAGGAGGCA GCAGGACACT CTGAGGGCCC CCTTGCCrGG 
3351 AGTCACTGAG AGAAGCTCTC TAGACXjGAGA TAGGCAGGGG GCCCCTGAGA 
3401 GAGGAGCAGG CCTTGAGCTG CCCAGGACAG AGAGCAGGAT GTCAGGGCCA 
3451 TGGTGGGCX:C AGGATTCCCX: GGCTGGATTC CCCAGTGCTT AACTCTTCCT 
3501 CCCTTCTCCA CAGCTTCCTA ACXAAGAGAG GCCGGCAGGT CTGTGCTGAC 
3551 cxx:agtgagg AGTGGGTCCA GAAATACGTC AGTGACCTGG AGCTGAGTGC 
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3601 CTGAGGGGTC CAGAAGCTTC GAGGCCCAGC GACCTCAGTG GGCCCAGTGG 
3651 GGAGGAGCAG GAGCCTGAGC CTTGGGAACA TGCGTGTGAC CTCCACAGCT 
3701 ACCrcrrCTA TGGACTGGTT ATTGCCAAAC AGCCACACTG TGGGACTCrr 

3751 cttaacttaa atntaattt atttatacta ntagttttt ataatttatt 
3801 tttgatttca cagtgtgttt gtgattgttt ggtctg. • gag TTCCCCCTGT 

3851 CCCCrCCACC TTCCCTCACA GTGTGTCTGG TGACAACXTGA GTGGCTGTCA 
3901 TCGGCCTGTG TAGGCAGTCA TGGCACCAAA GCCACCAGAC TGACAAATGT 
3951 GTATCAGATG CmTGTTCA GGGCTGTGAT CGGCCTGGOr AAATAATAAA 
4001 GATGTTCTTT TAAACXK5TAA ACCAGTATTG AGTrTGGTTr TGTmTCrG 
4051 GCAAATCAAA kTCACTGGTT AAGAGGAATC ATAGGCAAAG ATTAGGAAGA 
4101 GGTGAAATGG AGGGAAATTG GGAGAGATGG ggagcgctgc gacagagtta 
4151 TCCACTTCAC AAAATTCTGG AACATTGAAA CTACGAATAT GTTATAACTC 
4201 AAATCGTAAT ATGCACGCTC TAGGAGAATT AACTACTTGA ATGGCCACC 
4251 TTAAGCAGAG TATTCTGTAG GGCATATTCA TGATGAATCA AGCTCTTAAT 
4301 AGCAATTATT TACATTGTTG AGGCTTACTC CTCCTACTGA GTGCmTTA 
4351 TACATTGTTC ATTTAATCTT ACCAATGCAA TAGTACAGCT TAGGTACTAT 
4401 TAATACCrCC ACTTGACAGA AAAGTAACCC AGGGCTCAGA AAGGTTAGAC 
4451 AACTTGGCrG AGGITACACA GCACGTAAAC GGTCAATTGT GTTCCAAAAC 
4501 TGGACmTA TTGAACTACA GACTATGCTG TTAACCATTG AOCAAGTTAT 
4551 TTCCCAAAGT ATGACCCGCC TATAGTCAAA TCTTACCCCA TTCTTTAACA 
4601 GATGATACTT TATCCATTGC AACCACTTCC TGTCAGGATT CTGAGTTGAC 
4651 ATAGAGTGTT TCAGCAGTGA TTATTTAAGC CAATTACATC AGGATCTTTA 
4701 GGTGTAGACC TGGGAACTGA TATnTTATC AAGCTCATGA GGTGTTCCAT 
4751 AGCATGTTAA TGACTGAGAG CCACTGTCAA TAGAATTC 
which coiresponds to the cDNA sequence and derived amino acid sequence MlP-la 
(macrophage inflammatory protein 1-a). See Blum et al.. DNA Cell Biol. 9, 589-602 (1990). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 28) oft 

Met Lys Leu Cys Val Hir Val Leu Ser Leu Leu Met Leu Val Ala Ala Phe Cys Ser Pro 
Ala Leu Ser Ala Pro Met Gly Ser Asp Pro Pro Thr Ala Cys Cys Phe Ser Tyr Thr Ala 
Arg Lys Leu Pro Axg Asn Phe Val Val Asp Tyr Tyr GIu Thr Ser Ser Leu C^s Ser Ghi 
. Pro Ala Val Val Phe Ghi Thr Lys Arg Ser Lys Gin Val Cys Ala Asp Pro Ser Glu Ser 
Tip Val Gin GIu Tyr Val Tyr Asp Leu Glu Leu Asn 
encoded by the cDNA sequence (SEQ ID NO: 29) of: 

1 TTCCCCCCCC CCCCCCCCCC CCCCGCCCGA GCACAGGACA CAGCTGGGTT 
51 CTGAAGCTTC TGAGTTCTGC AGCGTCACCT CTGAGAAAAC CTCmTCCA 
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101 CCAATACCAT GAAGCTCTGC GTGACTGTCC TGTCTCTCCT CATGCTAGTA 

151 GCTGCCTTCr GCTCTCCAGC GCTCTCAGCA CCAATGGGCT CAGACCCTCC 

201 CACCXJCCTGC TGCmTCTT ACACCGCGAG GAAGCTIXXT CGCAACTTTG 

251 TGGTAGATTA CTATGAGACC AGCAGCCTCT GCTCCCAGCC AGCTGTGGTA 

301 TTCCAAACCA AAAGAAGCAA GCAAGTCTGT GCTGATCCCA GTGAATCCTG 

351 GGTCCAGGAG TACGTGTATG ACXHXKjAACT GAACTGAGCT GCTCAGAGAC 

401 aggaagtcit cagggaaggt cacctgagcc cggatgcttc tcx:atgagac 
451 acatctcctc catactcagg actcctctcc gcagttcctg tgccttctct 
501 taatttaatc tttrttatgt gccgtgttat tgtattaggt gtcatttcca 
551 ttatttatat tagtttagcc aaaggataag tgtcctatgg ggatggtcxta 
601 crgtcactgt ttctctgctg ttgcaaatac atggataaca catttgattc 
651 tgtgtgmr ccataataaa actttaaaat aaaatgcaga cagtta 

which coiresponds to the cDNA sequence and derived amino acid sequence MIP-ip 
(macrophage inflammatory protein 1-3). See Lipes et al., Proc. Natl. Acad. ScL U.SA. 85, 
9704-9708 (1988). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 30) of: 

Met Gin He De Thi Thr Ala Leu Val Cys Leu Leu Leu Ala Gly Met Trp Pro Glu Asp 
Val Asp Ser Lys Ser Met Ghi Val Pro Phe Ser Arg Cys Cys Phe Ser Hie Ala Glu Ghi 
Glu ne Pro Leu Arg Ala lie Leu Cys Tyr Arg Asn Thr Ser Ser He Cys Ser Asn Glu 
Gly Leu lie Phe Lys Leu Lys Arg Gly Lys Glu Ala Cys Ala Leu Asp Thr Val Gly Trp 
Val Gin Arg His Arg Lys Met Leu Arg His Cys Pro Ser Lys Arg Lys 
encoded by the cDNA sequence (SEQ ID NO: 31) of: 

1 ACCAGGCTCA TCAAAGCTGC TCCAGGAAGG (XCAAGCCAG ACCAGAAGAC 
51 ATGCAGATCA TCACCACAGC CCTGGTGTGC TTGCTGCTAG CTGGGATGTG 
101 CJCCGGAAGAT GTGGACAGCA AGAGCATGCA GGTACCCTTC TCCAGATGTT 
151 GCITCTCATT TGCGGAGCAA GAGATTCCCC TGAGGGCAAT CCTGTGTTAC 
201 AGAAATACCA GCTCCATCTG CTCCAATGAG GCJCTTAATAT TCAAGCTGAA 
251 GAGAGGCAAA GAGGCCTGCG CCTTGGACAC AGTTGGATGG GTTCAGAGGC 
301 ACAGAAAAAT GCTGAGGCAC TGCCCGTCAA AAAGAAAATG AGCAGATTTC 
351 TTTCCATTGT GGGCJrCTGGA AACCACATGG CITCACCTGT CCCCGAAACT 
401 ACCAGCCCrrA CACCATTCCT TCTGCCCTGC TTTTGCTAGG TCACAGAGGA 
451 TCrGCITGGT CHTGATAAGC TATGTTGTTG CACTTTAAAC ATTTAAATTA 
501 TACAATCATC AACCCCCAAC 
which corresponds to the cDNA sequence and derived amino acid sequraice human secreted 
protein a-309). See MiUw et al., J. Immunol. 143, 2907-2916 (1989). 
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In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 32) of: 

Met Lys Val Ser Ala Ala Leu Leu Cys Leu Leu Leu lie Ala Ala Thr Phe He Pro G]n 
Gly Lys Ala Gin Pro Asp Ala lie Asn Ala Pro Val Thr Cys Cys Tyr Asn Phe Thr Asn 
Arg Lys He Ser Val Gin Arg Leu Ala Ser Tyr Arg Axg lie Thr Ser Lys Cys Pro Lys 
Glu Ala Val lie Phe Lys Thr De Val Ala Lys Glu He i." Ala A; :*ro Lys Ghi Lys Trp 
Val Gin Asp Ser Met Asp His Leu Asp Lys Gin Thr Gin Thr Pro Lys Thr 
encoded by the cDNA sequence (SEQ ID NO: 33) of: 

1 CTAACCCAGA AACATCCAAT TCTCAAACTG AAGCTCGCAC TCTCGCCTCC 
51 AGCATGAAAG TCTCTGCCGC CCTTCTGTGC CTGCTGCTCA TAGCAGCCAC 
101 CTTCATTCCC CAAGGGCTCG CTCAGCCAGA TGCAATCAAT GCCCCAGTCA 
151 CXTGCTGTTA TAACTTCACC AATAGGAAGA TCTCAGTGCA GAGGCTCGCG 
201 AGCTATAGAA GAATCACCAG CAGCAAGTGT CCCAAAGAAG CTGTGATCTT 
251 CAAGACCATT GTGGCCAAGG AGATCTGTGC TGACCCCAAG CAGAAGTGGG 
301 TTCAGGATTC CATGGACCAC CTGGACAAGC AAACCCAAAC TCCGAAGACT 
351 TGAACACTCA CTCCACAACC CAAGAATCTG CAGCTAACTT ATTTTCCCCT 
401 AGCTTTCCCC AGACACCCTG TnTATTTTA TTATAATGAA TTTTGTTTGT 
451 TGATGTGAAA CATTATGCCT TAAGTAATGT TAATTCTTAT TTAAGTTATT 
501 GATGTTITAA GTTTATCTTT CATGGTACTA GTGnTTTTA GATACAGAGA 
551 CTTGGCKjAAA TTGCTTTTCC TCITGAACCA CAGTTCTACC OCTGGGATC 
601 TTTGAGGGTC TTTGCAAGAA TCATTAATAC AAAGAATTTT TTTTAACATT 
651 CCAATGCATT GCTAAAATAT TATTGTGGAA ATGAATATTT TGTAACTATT 
701 ACACCAAATA AATATATTTT TGTAC 
which corresponds to the cDNA sequence and derived amino acid sequence monocyte 
chemoattractant protein 1 (MCP-l). See Yoshimura et al.. FEBS Lett- 244, 487-493 (1989). 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 34) of: 

Met Lys Ala Ser Ala Ala Leu Leu Cys Leu Leu Leu Thr Ala Ala Ala Phe Ser Pro Gin 
Gly Leu Ala Gin Pro Val Gly He Asn Thr Ser Thr Thr Cys Cys Tyr Arg Phe He Asn 
Lys Lys lie Pro Lys Ghi Arg Leu Glu Ser Tyr Arg Arg Thr Thr Ser Ser His Cys Pro 
Arg Glu Ala Val De Phe Lys Thr Lys Leu Asp Lys Glu De Cys Ala Asp Pro Thr Gin 
Lys Trp Val Ghi Asp Phe Met Lys His Leu Asp Lys Lys Thr Ghi Thr Pro Lys Leu 
encoded by the cDNA sequence (SEQ ID NO: 35) of: 

1 AGCAGAGGGG CTGAGACCAA ACCAGAAACC TCCAATTCTC ATGTGGAAGC 
51 CCATGCCCTC ACCCTCCAAC ATGAAAGCCT CTGCAGCACT TCTGTGTCTG 
101 CTGCrCACAG CAGCTGCTTT CAGCCCCCAG GGGCTTGCTC AGCCAGTTGG 
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151 GATTAATACT TCAACTACCT GCTGCTACAG ATTTATCAAT AAGAAAATCC 
201 CTAAGCAGAG GCTGGAGAGC TACAGAAGGA CCACCAGTAG CCACTGTCCC 
251 CGGGAAGCTG TAATCTTCAA GACCAAACTG GACAAGGAGA TCTGTGCTGA 
301 CCCCACACAG AAGTGGGTCC AGGACTTTAT GAAGCACCTG GACAAGAAAA 
351 CCCAAACrCC AAAGCTTTGA ACATTCATGA CTGAACTAAA AACAAGCCAT 
401 GACTTTGAGAA ACAAATAATT TGTATACCCT GTCCITTCrC AGAGTGGITC 
451 TGAGATTATT TTAATCTAAT TCTAAGGAAT ATGAGCTTTA TGTAATAATG 
501 TGAATCATGG TTTTTCTTAG TAGATTTTAA AAGTTATTAA TATnTAATT 
551 TAATCTTCCA TGGATnTGG TGGGTTTTGA ACATAAAGCX: TTGGATGTAT 
601 ATGTCATCTC AGTGCTGTAA AAACTGTGGG ATGCTCCTCC CTTCTCTACC 
651 TCATGGGGGT ATTGTATAAG TCCTTGCAAG AATCAGTGCA AAGATTTGCT 
701 TTAATTG1TA AGATATGATG TCCCTATGGA AGCATATTGT TATTATATAA 
751 TTACATATTT GCATATGTAT GACTCCCAAA TTTTCACATA AAATAGATTT 
801 TTGTAAAAAA 
which conesponds to the cDNA sequence and derived amino acid sequence monocyte 
chemoattractant protein 3 (MCP-3). See: Stnictural and Functional Idaitificadon of Two . 
Human, Tumor-draived Monocyte Chcmotactic Proteins (MCP-2 and MCP-3) Belonging to tiie 
Chemokine Family. Jo Van Damme, Paul Proost, Jean-Pierre Lenaerts, and Ghislwi 
Opdenakker. J. Exp. Med. 176: 59-65, 1992. 

In another aspect, the present invention provides a heparanase having the amino acid 
sequence (SEQ ID NO: 36) of: 

Met Lys Val Ser Ala Ala Arg Leu Ala Val lie Leu lie Ala Hir Ala Leu Cys Ala Pro 
Ala Ser Ala Ser Pro Tyr Ser Ser Asp Thi Thr Pro Cys Cys Phe Ala Tyr He Ala Arg Pro 
Leu Pro Arg Ala His lie Lys Glu Tyr Phe Tyr Thr Ser Gly Lys Cys Ser Asn Pro Ala 
Val Val Phe Val Thr Arg Lys Asn Arg Ghi Val Cys Ala Asn Pro Glu Lys Lys Tip Val 
Arg Glu Tyr lie Asn Ser Leu Glu Met Ser 
encoded by the cDNA sequence (SEQ ID NO: 37) of: 

1 CCTCCGACAG CCTCTCCACA GGTACCATGA AGGTCTCCGC GGCACGCCTC 
51 GCTGTCATCC TCATTGCTAC TGCCCTCTGC GCTCCTGCAT CTGCCTCCCC 
101 ATATTCCTCG GACACCACAC CCrGCTGCIT TGCCTACATT GCCCGCCCAC 
151 TGCCCCGTGC CCACATCAAG GAGTATTTCT ACACCAGTGG CAAGTGCTCC 
201 AACCCAGCAG TCGTCnTGT CACCCGAAAG AACCGCCAAG TGTGTGCCAA 
251 CCCAGAGAAG AAATGGGTTC GGGAGTACAT CAACTCTTTG GAGATGAGCT 
301 AGGATGGAGA GTCCTTGAAC CTGAACTTAC ACAAATTTGC CTGTTTCTGC 
351 TTGCTCTTGT CCTAGCTTGG GAGGCTTCCC CTCACTATCC TACCCCACCC 
401 GCrCCTTGAA GGGCCCAGAT TCTGACCACG ACGAGCAGCA GTTACAAAAA 
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451 CCTTCCCCAC GCTGGACGTG GTGGCTCAGC CTTGTAATCC CAGCACTTTG 
501 GGAGGCCAAG GTGGGTGGAT CACTTGAGGT CAGGAGTTCG AGACAGCCTG 
551 GCCAACATGA TGAAACCCCA TGTGTACTAA AAATACAAAA AATTAGCCGG 
601 GCGTGGTAGC GGGCGCCTGT AGTCCCAGCT ACTCGGGAGG CTGAGGCAGG 
5 651 AGAATGGCGT GAACCCGGGA GGGGAGCTTG CAGTGAGCCG AGATCGCGCC 
701 ACTGCACrCC AGCCTGGGCG ACAGAGCGAG ACTCCX3TCTC AAAAAAAAAA 
751 AAAAAAAAAA AAAAAATACA AAAATTAGCC GCGTGGTGGC CCACGCCTGT 
801 AATCCCAGCr ACTCGGGAGG CTAAGGCAGG AAAATTGTTT GAACCCAGGA 
851 GGTGGAGGCT GCAGTGAGCT GAGATTGTGC CACTTCACTC CAGCCTGGGT 
10 901 GACAAAGTGA GACrcCGTCA CAACAACAAC AACAAAAAGC TTCCCCAACT 
951 AAAGCCTAGA AGAGCTTCTG AGGCGCTGCT TTGTCAAAAG GAAGTCTCTA 
1001 GGTTCTGAGC TCTGGCnTG CCTrGGCTTT GCAAGGGCTC TGTGACAAGG 
1051 AAGGAAGTCA GCATGCCTCT AGAGGCAAGG AAGGGAGGAA CACTGCACTC 
1101 TTAAGCTTCC GCCGTCTCAA CCCCTCACAG GAGCTTACTG GCAAACATGA 
15 1151 AAAATCGGGG 

which corresponds to the cDNA sequence and derived amino acid sequence Human T 
cell-specific protein (RANTES). See Schall et al.. J. Immunol. 141, 1018-1025 (1988). 

In another aspect, the present invoition provides a heparanase having the amino acid 
sequ^ce (SEQ ID NO: 38) of: 
20 Met Arg fle Ser Ala Thr Leu Leu Qrs Leu Leu Leu He Ala Ala Ala Phe Ser Be Ghi Val 

Trp Ala Gin Pro Asp Gly Pro Asn Ala Ser Hir Cys Cys Tyr Val Lys Lys Gin Lys lie 
Pro Lys Aig Asn Leu Lys Ser Tyr Arg Arg lie Thr Ser Ser Arg Cys Pro Trp Glu Ala 
Val lie Phe Lys Thr Lys Lys Gly Met Glu Val Cys Arg Glu Ala His Gin Lys Tip Val 
Glu Glu Ala He Ala Tyr Leu Asp Met Lys Thr Pro Thr Pro Lys Pro 
25 encoded by the cDNA sequence (SEQ ID Na 39) of: 

1 ACTGAAGCCA GCTCTCTCAC TCrCTTTCTC CACCATGAGG ATCTCTGCCA 
51 CGCTTCTGTG CCTGCTGCTC ATAGCCGCTG CTTTCAGCAT CCAAGTGTGG 
101 GCCCAACCAG ATGGGCCCAA TGCATCCACA TGCTGCTATG TCAAGAAACA 
151 AAAGATCCCC AAGAGGAATC TCAAGAGCTA CAGAAGGATC ACCAGTAGTC 
30 201 GGTGTCCCTG GGAAGCTGTT ATCITCAAGA CAAAGAAGGG CATGGAAGTC 

251 TGTCGTGAAG CCCATCAGAA GTGGGTCGAG GAGGCTATAG CATACTTAGA 
301 CATGAAAACC CCAACTCCAA AGCCFTGAAG AAATGTGCCT GAACAGAAAC 
351 CAACCTAGGA GCCAAGAAGC AAAAATTCCT C\CCGCrGTT CTTTCTGAGA 
401 ACTGrrGATG AAATGTGTTG ATCACGGTCC TAAGGGATAG GAGCTGTCTG 
35 451 TAGGAATGTG AAACAGTCAC GCCTAAGGAA TGGTCnTAA GTTATTAATA 

501 TTTTTATTTA ATTAGCXIATG TACITTGGTG TGATTTGAAT GTAAAGCTCT 
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551 GGAGACCTCA TGTCACTTTA ACATTGTGTT AGCTGCAGAA TTC 
which corresponds to the cDNA sequence and derived amino acid sequence human fic (growth 
factor-activated gene). See Heinrich et al., Molecular and Cellular Biology 13: 2020-2030, 
1993. 

5 In another aspect, the present invention provides a heparanase having the artiiao acid 

sequence (SEQ ID NO: 40) of. 

Asp Ser Val Ser He Phe De Thr Cys Cys Phe Asn Val lie Asn Arg Lys De Pro De Ghi 
Arg Leu Glu Ser Tyr Thr Arg De Thr Asn lie Gin Cys Pro Lys Glu Ala Val lie Phe Lys 
1\a Gly Lys Glu Val Cys Ala Asp Pro Lys Glu Arg Trp Val Arg Asp Ser Met Lys His 

10 Lys Asp Gin lie Phe Gin Asn Leu Lys Pro 

which corresponds to the cDNA sequence and derived amino acid sequence monocyte 
chemoattractant protein 2 (MCP-2). See VanDamme et al., J. Exp. Med. 176: 59 - 65, 1992. 

The purified heparanase of the present invention, allows for the convenient selection of 
compounds having anti-hepaianase activity (AHA compounds), i.e. inhibitors of heparanase 

15 activity (IHA), by measuring inhibition of heparanase activity. Inhibition of h^aranase activity . 
can be measured utilizing in vivo radiolabeled heparan sulfate/heparin. This ligand is 
radiolabeled to high specific activity by intraperitoneal injection of 0.5mCi of S-35 sulfate into 
C57 mice bearing a 1-2 cm basement membrane tumor (EHS; Engelbreth, Holm, Swarm tumor). 
TTie tumor is harvested after 16 hours and the heparan sulfate proteoglycan extracted in 4 

20 volumes of 6M urea, 20mM Tris pH 6.8, protease inhibitors, 0.15M NaQ and 0.5% triton X- 

100. The urea extract is chromatographed on an anion exchange column and the proteoglycan is 
eluted in a linear gradient of NaCI. The radiolabeled proteoglycan is exchanged into a solution 
of 4.0M guanidine-HCl, 20mM Tris pH 7.4 and applied to a size exclusion column. The 
proteoglycan peak is pooled and exchanged mto 0.1 5mM NaCI and 20mM Tris pH7.4. 

25 Purified, radiolabeled proteoglycan is coupled to commercially available agarose siq>port 

A quantitative assay of heparanase activity is constructed with the radiolabeled ligand in a multi- 
well format Briefly, known quantities of recombinant heparanase are added to a multi-well 
plate containing equal amounts of radiolabeled ligand in each well. Enzyme-ligand interaction 
proceeds overnight and the ligand-agarose conq)lex is recovered by centrifugation. Radioactivity 

30 in the liquid phase is determined by scintillation counting and is the measure of eaizyme activity. 
Potential enzyme inhibitors can be evaluated by adding the compound to the solution phase or 
alternatively adding the assay components to multi-well plates containing preweighed amounts 
of test con^oimd. 

In addition, the purified heparanase of the subject inv^tion can be used for therapeutic 
35 wound healing or can be immobilized onto filters and used to degrade heparin firom the blood of 
patients post-surgery. 
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Wound treatment can be achieved by administration to an afflicted individual an 
effective amount of a pharmaceutical composition comprising the purified heparanase in 
combination with a pharmaceutically acceptable, preferably slow releasing, carrier. See. e.g. 
PCT/US90/04772, incorporated herein by reference. 
5 Immobilization onto filters can be achieved by the methods well known in the art 

including those disclosed by Langer et al. in Biomaterials: Inter-facial Phenomenon and 
Applications, eds. Cooper et al, pp 493-509, 1982 and those described in U.S. Patent No. 
4,373,023, 4,863,611 and 5,211,850 (all incorporated herein by reference). 

Hie purified b^aianase of the subject invention can be prepared by the method 
10 described in procedure A or procedure B. but preferably procedure A. 

PROCEDURE A 

Reverse transcription of the mRNA fitom activated human leukocyte-derived cells 
(preferably lytnphocytes, neutrophils, platelets, Juricatt lymphoma cells, Dami cells (Greenberg et 
al.. Blood 72:1968-1977, (1988)] is used to pr^are the cDNA for the desired heparanase 

15 enzyme (preferably SEQ. ED. NO: 1; optionally SEQ. ID. NO: 3, SEQ. ID. NO: 5, SEQ. ID. 
NO: 7: SEQ. ID. NO: 13, SEQ. ID. NO: 15, SEQ. ID. NO: 17. SEQ. ID. NO: 19, SEQ. ID. 
NO: 21. SEQ. ID. NO: 23, SEQ. ID. NO: 25, SEQ. ID. NO: 27. SEQ. ID. Na 29; SEQ. ID. 
NO: 31. SEQ. ID. NO: 33 or SEQ. ID. NO: 35), employing standard PCR cloning techniques 
(described in Sambrook et al., in: Molecular Cloning, A Laboratory Manual, Second Edition, 

20 1989. Cold Spring Harbor Press). The cDNA encoding the h^iaranase en^me is cloned into 
Xbal/BamHl sites in the conmaercially available baculovirus vector pVL 1392 (Phamungrai; 
San Diego, CA). High titer infectious virus is selected for use in infecting sf9 insect cells 
(Luckow and Simamers, Bio/Technology. 6,47 1988). Saiim-free medium conditioned by 
infected sf9 cells is collected after 72 hours. This media is the starting material for purification 

25 of recombinant heparanase. Serum-free conditioned media is adjusted to contain 20mM Sodium 
Acetate, pH 5.0, 0.15M NaCl, loiM reduced glutathione (GSH), IraM dithiothreitol (DTT) mi 
lOmM beta-octylglucoside. Medium is applied to a column of cation-exchange resin 
(Pharmacia) and eluted from the colimm in a linear gradient of NaQ. Fractions cont^ning 
heparanase are pooled and diluted to a final salt concentration of 0.1 5M NaCl. To this solution 

30 is added 20inM Tris and the pH adjusted to 7.0. The solution is appiied to a column of 

heparin-Sepharose (Pharmacia) and eluted with a linear salt gradient buffered to pH 5.0 with 
20mM Soditrai Acetate. Heparanase is concentrated to 0.5mg/ml in an Amicon concentrator 
fitted with a YM-2 membrane and stored at -80 degrees. For optimal activity (greater than 50 
units heparanase activity per ug protein) incubation in the presence of transglutaminase, under 

35 reducing conditions, in accordance with the procedure in Example 2, Part C. 
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PROCEDUREB 

This procedure describes the purification to homogeneity of heparanase (SEQ. ID. NO: 
1) from human blood cells or cell lines (such as platelets) under reducing conditions which 
allow for the occurrence of post-translational modifications that increase the specific activity of 
5 heparanase and make it suitable for use in the above describ«i screening assay. The cells are 
treated with a suitable activator (such as, but not limited to. thrombin or histamine) which 
allows for the release of eruymes and cytokines from the cell. Reducing agents are added to the 
supernatant from the activated cells. Suitable reducing agents include dithiothreitol (DTT), 
dithioerythritol (DTE), reduced glutathione (GSH), and P-mercaptoethanol. The reduced, 

10 activated supernatant is chromatographed on a column of immobilized heparin or heparan sulfate 
under reducing conditions at pH 5, using a salt gradient (such as NaQ, KCl, or oth^ salt) to 
elute the bound proteins. Fractions containing heparanase activity are pooled and exchanged 
into any buffer appropriate for the pH of 6.8 and containing 0.15 M NaCl, reducing agents, and 
non-ionic detergent This is passed over any suitable anion-exchange column (bed volume of 5 

15 ml or less). The unbound material from this colunm is adjusted to pH 5 with acid, and is 

loaded onto any suitable cation-exchange column (bed volume of 5 ml or less), equilibrated in a 
suitable pH 5 buffer containing 0.15 M NaCl, reducing agents, and non-icmic detergents. The 
bound protein is eluted from the column with a salt gradient, and the fractions containing 
heparanase activity are pooled and size fractionated to below 30,000 daltons with 30 K-cut-off 

20 membranes. The protein below 30,000 daltons is concentrated by either beparin-sepharose 
chromatography or by centrifiigation through 5 K-cut-off membranes. 

The present invention is seen more fidly by the examples set forth below. 
Example 1: Use of Heparanase as a screen for AHA compounds. 

1. Heparan sulfate, metabolically labeled (S-35) to a high-specific activity- as described 
25 above for the EHS tumor, prepared by papain digestion of chromatographically purified heparan 

sulfate proteoglycan is coupled to cyanogen bromide activated Sepharose-6B (Pharmacia) 
according to manufacturer's instructions. 

2. ^^S-Heparan suIfate-Sepharose 6B is resuspended in: 0.15 M NaQ, 0.03% human 
serum albumin, 10 pM MgQj' 1^ PM CaQj. antiproteolytic agents (1 pg/ml leupeptin, 2 pg/ml 

30 antipain, 10 pg/ml benzamidine, 10 units/ml aprotinin, 1 pg/ml chymostatin, and 1 pg/ml 
pepstatin), and 0.05 M Na acetate, pH 5.6 and 5,000 cpm, in a total volume of 200 pi, are 
aliquoted into each well of a 96 well plate. To each well is added 5 imits of activated 
heparanase and the digestion allowed to proceed overnight at 37 degrees. 

3. Separation of digested product is accomplished by centrifugation of the 96 well 
35 plate. The supernatant, containing cleaved heparan sulfate is decanted and quantitated by 

scintillation counting. 
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4. Inhibitors of hepaianase activity can be introduced into the Uquid-phase of the 



5. A potential inhibitor of heparanase activity would be identified by its abiUty to 
reduce the amount of radiolabeled heparan sulfate released into the supernatant by 50% at a 
5 concentration of 1 pM or less. 

Exami>le2: The preparation of hepaianase under reducing conditions as outUned in 

Procedure B. 

Part A: 

Platelet-rich plasma (10^ platelets/ml; 1800 nil) is obtained fitom healthy, itiformed 
10 volunteers by plasmapheresis. The plasma is removed from the platelets by centrifugation 

Ql^l^,etal., Exp. Cell Res, 109: 429-437.1977). Platelets suspended in phosphate buffered 
saline (PBS; 0.1 original volume) are then stimulated with I U/ml thrombin for 5 mm at 37-C. 
•nns concentration of thrombin was reported to release 100% of the heparanase activity from 
platelets (Oldberg, et aU Biochemistry 19: 5755-5762, 1980). FoUowing activation, the 
15 thromlrin is inactivated by the addition of 100 mM phenyhnethylsulfonylfluoride (PMSF), and 
the platelets are centrift.ged at 2000 xg for 30 min at A-C. TTie s,q»ematant is stored at -80°C 
until used for the chromatographic purification of heparanase (Part B). 

PartB: Chromatogr^hic purification of heparanase. 
1. Heparin-Sepharose Chromatography. Activated platelet supematants are pooled and 
20 adjusted to contamlmMGSH and ImMDTT. This pool is loaded (2.5 ml/inin) onto a 

colmnn of heparin-sepharose (2.6 x 7.5 cm, 40 ml) equilibrated in 1 mM GSH, 1 mM DTT. 150 
mM Naa, 10 mM NaPO^, pH 7.4. After loadmg the sample, die column is washed with 200 
ml of 0.15 M Naa. 1 mM GSH. 1 mM DTT. 10 mM Na acetate, pH 5. foUowed by 60 ml of 
0.35 M Naa. 1 mM DTT, 1 mM GSH, 10 mM Na acetate. pH 5. The column is then eluted 
25 with a 160 ml linear gradient between 0.35 M Naa and 1.5 M Naa in the same buffer. 

AUquots of each fraction are used for determination of heparanase activity by the "Purification 
Assay" described lata". 

2. Anion-exchange chromatography (For example, DEAE-Sephacel, Pharmacia). The 0.9 
M - 1.15 M Naa fractions from the heparin-sepharose column are concentrated using a stirred 

30 cell fitted with a PM-10 membrane, and the buffer is exchanged to 0.15 M Naa 1 mM DTT. 1 
mM GSH. 10 mM Pnoctylglucoside. 10 mM sodium phosphate. pH 6.8 (8 ml). This sample is 
loaded onto a 5 ml column of DEAE-Sephacel equiUbrated in the same buffer. After loading, 
the column is washed to baseline absorbance (280 nm) with the eqmlibration buffer. The flow- 
through and wash with equiUbration buffer axe coUected as one pool. Hie column is then eluted 

35 with 10 ml of 0.15 M Naa. 10 mM ^octylglucoside. 1 mM GSH, 1 mM DTT, 10 mM Na 
acetate. pH 5. followed by 10 ml of 1.5 M NaO, 10 mM P-octylglucoside. 1 mM GSH. 1 mM 
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DTT, 10 mM Na acetate, pH 5. Aliquots of each pool are used for determination of heparanase 
activity by the 'Purification Assay". 

3. Cation Exchange. The unbound sample from the DEAE-Sephacel column is adjusted to 
pH 5 with glacial acetic acid and loaded onto a cation exchange column (Poros HS/F, 4,6 mm x 

5 50 mm; PerSeptive Biosystems), pre-equilibrated with 0.15 M NaQ, 1 mM DTT, 1 mM GSH, 
10 mM ^-octylglucoside, 10 mM Na acetate, pH 5. TTie flow rate is 3.0 ml/min and 3 ml 
fractions are collected. After washing the colunm with 35 ml of equilibration buffer, the colunm 
is developed with a 55 ml linear gradient between 0.15 M and 1.5 M NaCl in the same buffer. 
10 pi aliquots of the fractions are used for determination of heparanase activity by the 
10 "Purification Assay". 

4. Size fr^tionation to < 30 kD and concentration on immobilized heparin (Hi-trap 
heparin-sepharose, Pharmacia). The activity from the Poros HS/F column is size fractionated by 
centrifiiging through 30,000 molecular weight cut-off filtras (NOllipore ultrafiree-MC 30,000 
NMWL filter units). The < 30 kD pool is diluted to contain 0.15 M NaCl, and is loaded onto a 

15 1 ml Hi-tr^ heparin column, pre-equilibrated with 0.15 M NaCl. 1 mM DTT, 1 mM GSH, 10 
mM Na acetate, pH 5. The column is eluted with 1.2 M NaCl in the same buffer and the single 
eluted peak contains the heparanase activity. 

Part C: Properties of the purified heparanase. 

The final yield of h^araneise protein from 1850 ml platelet-rich plasma was 2.7 mg. 

20 Protein concentration was determined by the method of Lowry (J. Biol. Chem. 193: 265-275, 
1951), or if more precise detexminations were required, by amino acid analysis on an amino acid 
analyzer (Beckman 6300). The overall recovery of activity was 8%, with a 4150-fold 
purification. The preparation was judged to be homogeneous by the presence of a single band 
of 9000 daltons on an 18% silver-stained SDS-polyacrylamide gel, run according to the method 

25 of Laemmli {Nature 227: 680-685, 1970). 

The pH optimum of the pxuified heparanase was determined by conducting the 
'T*urification assay" activity between pH 3.5 and 8.0, using a citrate buffer (pH 33 - 6.0), 
citrate-phosphate buffer (pH 6.5 - 7.0). and phosphate buffer (pH 7.5 - 8.). Heparanase was 
active between pH 5.0 and 8.0, with the optimiun pH at 5.8. 

30 N-tenninal amino acid sequencing of heparanase produced by this procedure was 

performed using a gas/liquid phase Protein Sequencer (Applied Biosystems Inc. Model 470). 
Phenylthiodantoin amino acids were resolved and quantitated by an on-line HPLC system 
(Model 120, Applied Biosystems Inc.) with data analysis on a Nelson Analytical System. N- 
terminal amino acid sequences of the heparanase produced in this example were 85 % 

35 SEQ. ID. NO: 9 (namely: 
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Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala 
15 10 15 

Glu Leu Arg). 

which is identical to CTAP-m. and 15% SEQ. ID. NO: 10 (namely: 

5 Ser Ser Thr Lys Gly Gin Thr Lys Arg Asn Leu Ala Lys Gly Lys Glu). 
1 5 10 15 

which is the precursor form, platelet basic protein. Interestingly, the N-terminal sequence of 
comm«cial P-thromboglobulin (namely, Calbiochem (Cat # 605165), Celsus Laboratories (Cat 
# 41705). and Haematologic Technologies (Cat. # HBTG-0210), which has low levels of 
10 heparanase activity, was 100% SEQ. ID. NO: 11 (namely: 

Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala Glu), 

1 .mxiS 10 .15 

indicating thal vfte commwcial preparatio;. is actually CTAP-III and not P-thromboglobuIin. 

Chromatofocusing of the heparanase produced by this procedure results in two peaks of 
15 differing isoelectric points. To perform the chromatofocusing, hspaianase is dissolved in 0.025 
M imidazole. pH 13. The sample is loaded onto a 0.5 x 20 cm column of Polybuffer 
Exchanger 94 (Pharmacia), equilibrated wth 0.025 M imidazole, pH 73. Immediately after 
sample loading, Polybuffer 74 (Pharmacia; 1:8, pH 4) is pumped onto the column at 03 ml/min. 

2 ml fractions are collected, and the pH of each fraction is determined by a narrow range (pH 4- 
20 7) pH paper. Aliquots of each fraction are used to determine heparanase activity by the 

"Purification Assay." AH of the activity is associated with an absorbance (280 nm) peak that 
elated at pH 4.8 to 5.1, representing s^projdmately 10% of total protein, while 90% of the 
protein is eluted at pH 7.3 and is inactive. Aliquots of each protein peak are separated from the 
ampholytes by C4 reverse phase chromatography. The peak that eluted from the chromato- 

25 focusing column at pH 7.3 has JV-teiminal sequences for platelet basic protein and the processed 
form, CTAP-in. The peak that is eluted frx)m the chromatofocusing column at pH 4.8 - 5.1 also 
contains the sequences of platelet basic protein and the processed form, CTAP-m. All of the 
platelet basic protein processed forms have pi's that are calculated and rq;>orted to be greatex 
than 7.6. Thus, the heparanase activity resides in the platelet basic protein and/or the processed 

30 form, CTAP-ffi that is modified such that the pi is lowered to 4.8 - 5.1. 

Heparanase obtained after cliromatofocusing exhibited a specific activity of 80 unitr ug 
protein, using the "Purification Assay." This represraits a 1000-fold increase in the specific 
activity compared to the commercial protein (p-thromboglobulin; 0.075 units/pg protein). 
The modification that may be responsible for the lower isoelectric point of active 

35 heparanase is ADP-ribosylation. ADP-ribosylation (Adenine diphosphate-ribosylation) is a post- 
translational modification of protems or DNA in which the ADP-ribose group of NAD 
(Nicotinamide adenme dinucleotide) is enzymaiically transferred to proteins or DNA. Since this 
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modification adds two negatively charged phosphate groups to a molecule, it would result in a 
lower isoelectric point. Activated platelet supematants were incubated in 1 mM DTT, 2 mM 
MgQ^, 100 mM HEPES, pH 7.4, and 0.5 pM ['^P]NAD (Specific activity = 1000 Ci/mmol). 
The labeled proteins were separated by SDS-polyacrylamide gel electrophoresis on an 18% gel, 
5 transferred to PVDF (polyvmylidene difluoride) membrane, and exposed to X-ray fibn. The 
autoradiogram demonstrates the incorporation of (^^PJ into a protein of 8000 daltons. TTie 
PVDF membrane was immunoblotted with the anti-Peptide C antisera (1:1500 in PBS 
contaming 5% dry milk, 0.05% Tween-20, 0.15 M NaQ, 20 mM Tris, pH 7.4, 2 hours at room 
tempeiatare, followed by incubation with peroxidase-Iabeled goat anti-chidcen IgG (1:500 in 

10 above buffer, 1 hour room temperature), and reacted with a peroxidase substrate. The 

immunoblot revealed that the 8000 dalton that was labeled with [^^P] was CTAP-m/heparanase. 
The addition of 200 pM sodium nitroprusside, a spontaneous releasCT of nitric oxide, to the 
ADP-ribosylation reaction resulted in 5-fold more incorporation of ['^P] label into CTAP- 
ni/h^aianase, suggesting that this modification can be regulated in vivo by nitric oxide. 

15 Finally, in an analgous manner to that of gIyceraId^yde-3-phosphate dehydrogenase, another 
platelet ADP-ribosylated glycolytic enzyme (Zhang and Snydrar, Proc. Natl. Acad. Sci. USA 89: 
9382-9385), it was determined that CTAP-lII/heparanase has an auto-ADP-ribosylation activity, 
since the [^^P]-ADP-ribosylation of CTAP-in/heparanase occurs in reactions where the only 
protein present is commercial CTAP-m or purified heparanase. Other chCTtioldne family 

20 membos tested, which mcludes IP-1-. IL-8, gro-a, and MCAF, also have auto-ADP-ribosylation 
activity. 

It IS contemplated that the high specific activity of CTAP-III/hiq)aranase is a 
consequence of ADP-ribosylation of the enzyme in the presence of nitric oxide. It is further 
contemplated that the action of transglutaminase on the ADP-ribosylated enzyme will lead to 

25 further increase in the specific activity. 

An amino acid composition of die heparanase produced in Example 2 gave the expected 
amino acid composition for CTAP-m and N-t^minal sequracing revealed sequences for platelet 
basic protein and the processed form, CTAP-DI , confirming that the heparanase activity is 
contained in this set of processed proteins and is not due to a minor contaminant. The presence 

30 of heparanase activity in three commercial sources of ^-thromboglobulin also confirms this 
conclusion. In addition, polyclonal antibodies to P-tfaromboglobulin were found to precipitate 
30 - 70% of the heparanase activity in three separate experiments, providing additional 
confirmation. 

The activation of heparanase with transglutaminase (prepared in accordance with 
35 Example 2, Part B) results in a substantial (about 13-fold) increase in the specific activity of the 
enzyme. The heparanase (2 ul at 56 nM) obtained by Example 2, Part B is treated with either 



PCT/US94/08207 

WO 95/04158 



transglutaminase from guinea pig liver (4 mU; Sigma) or with Factor XTH (1 pg; Celsus 
Laboratories, Inc.), the blood coagulation factor that is activated by treatment with 5 units of 
thrombin at 37 degrees for 30 minutes. Heparanase is activated by incubation of either 2mU 
Uver transglutaminase or 5 units of activated Factor Xm in the presence of O.IM NaAcetate 
5 buffer at pH 6.0 contaming ImM reduced glutathione and ImM CaQ for 35 minutes at 37 
degrees. Treatment of heparanase with either type of transglutaminase results in a substantial 
increase in the specific activity of the h^aranase. 

•nie high degree of sequence identity between CTAP-in ana lnteileukin-8, a CXC 
chemokine family member, assures that an essentially identical foldinr pattern will be shared by 
10 the two proteins. Since the 3-dimensional structure of Interleukin-8 is known (Qoie, er a/.. 
Biochemistry 29: 1689-1696.1990; Baldwin, ef aZ., J. Biof. CW 265: 6851-6853), one can 
model the same for CTAP-m. Such a model can serve to direct i.- .arch into rationally 
designed IHA and to help explain the action of transglutaminase in activating the CTAP-m. 
Part D: Purification Assav for H eparanase Activity 
15 Heparanase activity from platelets or column fractions is detected by its ability to digest 

the > 70 kD 35S-HSPG to produce lower molecular weight products. Each digest contains 10 pi 
sample. ^^s-HSPG (2000 cpm). 0.15 M NaQ, 0.03% human serum albumin, 10 pM MgClj' 10 
pM CaCLj, antiproteolytic agents (1 pg/ml leupeptin. 2 pg/ml antipain, 10 pg/ml benzamidine, 
10 units/ml aprotinm. 1 pg/ml chymostatin, and 1 pg/ml pepstatin). and 0.05 M Na acetate. pH 
20 5.6 in a total volume of 300 pi. Digests are carried out for 3 to 21 h. The presence of lower 
molecular weight radiolabeled products is detected by centrifiigation tiirough 30,000 MW-cutoff 
filters. The digests containing 2000 cpm of ^^S-HSPG (> 70 K) axe centrifuged through 30.000 
molecular weight cut-off filters (MilUpore Ultrafree-MC 30.000 NMWL filter miits). ^Ss-HSPG 
degradation is evident by the presence of radioactivity in ibc filtrate tiiat passed through die 30 
25 K membrane; tiiis heparanase activity is expressed as the % of total cpm < 30 K for a given 
digest. Analysis of heparan sulfate degradation by tills method is quick and reproducible. 1 
unit of heparanase activity is defined as 1% cpm < 30 K per h. For pH optimum determmation, 
the 0.1 M Na acetate buffer is replaced by 50 mM citrate, citrate-phosphate, or phosphate buffer 
at varying pH's. For samples from chromatographic steps performed under reducing conditions 
30 (1 mM GSH. 1 mM DTD, the concentration of a tiiiol oxidant (diamide) needed for optimum 
activity is determined. This concentration (100 pM diamide) is added to all assay tubes when 
reduced samples are assayed. 

Pre paration of ^Ss-HSPO (>70 K) for u ^e-. in die "Purification Assay." 

35S-HSPG (>70 K) is prepared from mice bearing a basement membran tumor tiiat 
35 overproduces HSPG (EHS tumor), using modifications of die metiiod of Ledbetter. eL al.. 1987. 
Briefly, tiie radiolabeled HSPG was prepared by injecting C57BL mice bearing die EHS tumor 
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(Oridn, et-al., 1977) with scxiium [-^^Slsulfate (0.5 mCi/mouse) 18 h before harvesting the tumor. 
The HSK5 is extracted from the weighed tumor with 6 voliunes (w/v) of Buffer A (3.4 M NaQ, 
0.1 M 6-aminohexanoic acid, 0.04 M EDTA, 0.008 M AT-ethyhnaleimide, 0.002 M PMSF, and 
0.05 M Tris-HQ, pH 6.8), by homogenization with a Polytron for 30 s, followed by stiiiiiig at 

5 4*'C for 1 h. Insoluble material is collected by centrifugation (12,000 x g for 10 min), and the 
stipematant is discarded. The insoluble residue is reextracted with 2 volumes (original tumor 
weight) of Buffer A for 30 min with stirring at 4'*C Insoluble material is again collected by 
craitrifugation, and the supernatant fraction is discarded. The insoluble matoial is then 
suspended in 6 volumes of Buffer B (6 M urea, 0.1 M 6-aminohexanoic acid, 0.04 M 

10 ethylenediaminetetraacetic acid (EDTA). 0.002 M PMSF, and 0.05 M Tris-HQ. pH 6.8), 

homogenized with an electric homogenizer (Polytron) for 30 s, and stirred for 2 h at 4°C. The 
mixture is centrifiiged to remove insoluble material, and the supernatant is retained. The 
insoluble material is reextracted with 2 volumes of Buffer B. The mixture is centrifuged, and 
the supernatant is combined with the previous siqiematanL 

15 •'^S-HSPG is isolated from the Buffer B supernatant by sequential chromatography on 

anion exchange and gel filtration columns. The Buffo: B supernatant is dialyzed overnight 
against 10 volumes of 6 M urea, 0.15 M NaQ, 0.05 M Tris-Ha, pH 6.8, and is adjusted to 
contain 0.5% non-ionic detergent (Triton X-100). This supernatant (from 11 g tumor) is 
chromatogn^hed on a 30 ml colunin of anion exchange resin (DEAE-Sephacel) equilibrated 

20 with 6 M nrcz, 0.15 M NaQ, 0.05% Triton X-100, 0.05 M Tris-HCS, pH 6.8. After loading the 
supernatant and washing with the eqxiilibration buffer, the column is developed with a 250 ml 
linear gradient between 0.15 M NaQ and 1.15 M NaQ (Flow «= 2.0 ml/min). Rractions are 
sampled for radioactivity, and those containing the ^^804 label that elutes from the DEAE- 
Sephacel between 0.4 M and 0.8 M NaQ are pooled. The proteoglycan is precipitated by the 

25 addition of 4 volumes of 100% EtOH at -20'*C overnight The precipitate is collected by 

centrifugation and is solubilized in 1 ml of Buffer C (4 M Gu-HQ, 20 mM Tris-HQ, pH 7.2). 
This solubilized pellet is used for chromatography on a calibrated gel filtration colunm (1.0 x 50 
cm colunm of Superose 6; Pharmacia) equilibrated in Buffer C (Flow <= 0.5 ml/min). Fractioos 
are sampled for radioactivity, and those containing the '^804 label that elutes with a molecular 

30 weight > 70 kD were pooled. The proteoglycan is precipitated with 100% EtOH as described 
above. The pellet is dissolved in 3 ml PBS, and dialyzed against 3 x 100 volimies of PBS. 
Each preparation of ^^S-HSPG is confirmed to be > 98% heparan sulfate by susceptibility to 
low pH nitrous acid degradation (Shiveley and Conrad, Biochemistry 15: 3932-3942, 1976). 
Example 3: Preparation of cDNA encoding Heparanase. 

35 Media is removed from cultured HEL{HEL 92.1.7; Human erythroIeukemia;ATCX: No. 

TIB 180) cells stimulated with lOnM phorbol 12-myristate 13-acetate (Sigma Chemical Co., St 
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Louis, MO) and the cells scraped from the dish and pelleted by centrifugation. The pellet is 
extracted with 200m1 of TRI reageait (Molecular Research Center Inc. Cincinnati, OH) and the 
total cellular RNA is prepared according to the manufacturer's instructions. To prepare first 
strand synthesis the revrase transcriptase reaction was performed with lOul of total cellular RNA 
5 in the presence of 4«1 of 5x transcriptase buffer (Bethesda Research Laboratories, Gaithersbarg, 
MD), lul 0.2mM DTT, 4ul random hexanucleotides (Amersham Corp. Arlington Heights, ILL), 
and 1h1 lOmM dNTP (BRL). This solution is heated to 95 degrees C for 5 minutes and then 
placed on ice. To this is added ImI RNAsin and I«l reverse transcriptase (M-MLV), (Promega, 
Madison WI). This is incubated at 37 degrees for 60 minutes and then placed on ice. The 

10 polymerase chain reaction is carried out as follows. To 3ul of the first strand (above) is added 
lul of each Primer (see below), TIul of water IOmI lOx PGR buffer (Petkin Elmer Cetus, 
NorwaUc CT) and 2«1 each dNTP. This solution is heat daiatured at 95 degrees C and 1«1 
An^)litaq DNA polymraase (Pwidn Bmer Cetus) is added. Hybridization temperature begins at 
72 degrees and is lowered by one degree pex cycle until reaching 55 degrees. Each 

15 hybridization step is followed with a constant elongation temperature of 72 degrees. Upon 
conqiletioQ the solution is left at 0 degrees until storage at -20 degrees. The products of the 
PGR reaction are electrophoresed on 3% NuSieve, 1% agarose gels and bands of expected size 
are excised and purified by standard procedures. 
Primers: 

20 Platelet Basic Protein: TGG ACT AGT ATG TCC TCC ACC AAA GGA CAA ACT AA 
CTAP DI: TGG ACT AGT ATG AAC TTG GCG AAA GAG GA 
B-thrombglobulin: TGG ACT AGT ATG GGC AAA GAG GAA AGT CTA GAC AG 
NAP-2: TGG ACT AGT ATG GAA CTC CGC TGC ATG TGT ATA AA 
Example 4: Preparation of cDNA raicoding Heparanase. 

25 Media is removed from cultured leukocyte-d^ved cells [e.g., lymphocytes, neutrophils, 

platelets, Jurkatt lynq)homa cells, Dami cells (Greenberg et aL. Blood 72:1968-1977, (1988)], 
stimulated with Concanavalin A or phoibol 12-myristate 13 acetate (Sigma Chemical Co., SL 
Louis, MO) and the cells scraped from the dish and pelleted by centrifugation. The pellet is 
extracted with 200ul of TRI reagent (Molecular Research Center Inc. Cmcinnati, OH) and the 

30 total cellular RNA is pr^ared according to the manufacturer's instructions. To prepare first 

strand synthesis lb& reverse transcriptase reaction was performed with IOmI of total cellular RNA 
in the presence of 4«1 of 5x transcriptase buffer (Bethesda Research Laboratories, Gaithersburg, 
MD), 1m1 0.2mM DTT, 4ul random hexanucleotides (Amersham Corp. Arlington Heights, ILL), 
and lul lOmM dNTP (BRL). This solution is heated to 95 degrees C f or 5 minutes and then 

35 placed on ice. To this is added lul RNAsin and lul reverse transcriptase (M-MLV), (Promega, 
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Madison WI). This is incubated at 37 degrees for 60 minutes and then placed on ice. TTie 
polymerase chain reaction is carried out as follows. To 3u\ of the first strand (above) is added 
1«I of each Primer (see below), 77«1 of water 10«1 lOx PGR buffer (Peridn Elmer Cetus, 
Norwalk CT) and 2ul each dNTP. This solution is heat denatured at 95 degrees C and lul 
5 Amplitaq DNA polymerase (Peridn Elmer Cetus) is added. Hybridization temperature begins at 
72 degrees and is lowered by one degree per cycle until reaching 55 degrees. Each 
hybridization step is followed with a constant elongation temperature of 72 degrees. Upon 
completion the solution is left at 0 degrees until storage at -20 degrees. The products of the 
PCR reaction are electrophoresed on 3% NuSieve, 1% agarose gels and bands of expected size 
10 are excised and purified by standard procedures. 
Primers: 

Platelet Basic Protein: TGG ACT ACT ATG TCC TCC ACC AAA GGA CAA ACT AA 

CTAP m: TGG ACT AGT ATG AAC TTG GCG AAA GAG GA 

B-thrombglobulin: TGG ACT AGT ATG GGC AAA GAG GAA AGT CTA GAC AG 

15 NAP-2: TGG ACT AGT ATG GAA CTC CGC TGC ATG TGT ATA AA 

All temperatures expressed throughout the subject specification are in degrees Cendgrade. 

The cDNA encoding h^aranase is preferably cloned into a vector designed for 
ejq)ression in eukaiyotic cells, rather than into a vector designed for e}q>ression in prokaryotic 
cells (e.g. E. coli). Eukaryotic cells are prefOTed for expression of genes obtained from higher 

20 eukaryotes because the signals for synthesis, processing, and secretion of these proteins are 
usually recognized, whereas tins is often not true for prokaryotic hosts (Ausubel, er oi., ed., in 
Short Protocols in Molecular Biology, 2nd edidon, John Wiley & Sons, publishers, pg.16-49, 
1992.). Eukaryotic hosts may include, but are not limited to, the following: insect cells, 
Afiican green monkey kidney cells (COS cells), Chinese hamster ovary cells (CHO cells), and 

25 Murine 3T3 fibroblasts. 

Experiments demonstrating that a synthetic peptide of CTAP-III/NAP-2 or antisera 
raised against a synthetic peptide of CTAP-in/NAP-2 inhibit the heparanase activity of CTAP- 
III/NAP-2 suggest that the amino acids participating in enzymatic catalysis are contained in a C- 
terminal region of the enzyme. 

30 Peptide Synthesis; A C-terminal peptide contained within the sequences known for CTAP-IH 
(SEQ ID NO: 1), Platelet Basic Protein (SEQ ID NO: 3). p-thromboglobulin (SEQ ED NO: 5), 
and NAP-2 (SEQ ID NO: 7), and an N-tenninal p^tide contained within the sequences known 
for CTAP-m (SEQ ID NO: 1) and Platelet Basic Protein (SEQ ID NO: 3) were synthesized 
accorcUng to standard procedures. The N-terminal peptide has the following sequence (SEQ ID 

35 NO: 41: Asn Leu Ala Lys Gly Lys GIu Glu Set Leu Asp Ser Asp Leu Cys, in which the final 
Cys residue was added to regions of known sequence (SEQ ID NOS: 1,3) for the purpose of 
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conjugation to a carrier protein. The C-terminal peptide has the following sequence (SEQ ID 
NO: 42): Cys Asn Ghi Val Glu Val He Ala Thr l^u Lys Asp Gly Arg Lys He Cys l^u Asp 
Pro Asp Ala Pro Arg De Lys Lys De Val Gha Lys Lys encoded by the cDNA sequence (SEQ 
ID NO: 43) of TGCAACCAAG TCGAAOTGAT AGCCACACTG AAGGATGGGA 
5 GGAAAATCrcCCTGGACCCAGATCCTCCCAGAATCAAGAAAATrGTACAG 

AAAAAA. These p^tides (SEQ ID NOS: 41 and 42) were produced by stepwise soUd phase 
peptide synthesis on an Applied Biosystems 430A Peptide Synthesizer. The 9- 
fluoroenylmethyloxycarbonyl (Fmoc) group was used as the N« amino protecting group, and 
temporary side-chain protectin groups were as follows: Arg (Pmc), Asn (Trt), Asp (OtBu). Gto 
10 Crrt) Glu (OtBu), His CTit), Lys (Boc). Ser (tBu). Thr (tBu). Each residue was single coupled 
using a HBTU/NMP protocol and capped with acetic anhydride before the next synthesis cycle. 
After removal of the N-termmal Fmoc group, temporary side-chain protecting groups were 
removed and the peptide cleaved from the resin by treatment with 95% TFA/5% scavengers 
(ediyl methyl sulfide/anis. ; /U-ethanedithiol. 1:3:1) for two hours at room temperature. The 
15 crude peptides were precipitated from die cleavage solution with cold diethyl ether. Tlie 
ptecipiuted peptide was coUected on a sintered glass funnel, washed with diethyl ether, 
dissolved in dilute acetic acid, evaporated to dryness under reduced pressure, and the lesidue 
was redissolved and lyophillized from glacial acetic acid. The crude peptides were purified by 
preparative reverse phase chromatography on a Phenomenex C-18 column (223 x 250 mm) 
20 using a water/acetonitrile gradient, each phase contaming 0.1% trifluoracetic acid (TFA). Qean 
fractions, as determmed by analytical HPLC, were pooled, the acetonitrile was evaporated under 
reduced pressure, and tiie aqueous solution was lyophillized. The ourified peptides were 
characterized by time of fUght or FAB mass spectroscopy. 

Further. SEQ ID NO: 42 can be produced by recombinant DNA methodology as stated 
25 in Procedure A (page 21). 

Anti^ra Production: TTie synthetic peptides of CTAP-III/NAP-2 were conjugated to keyhole 
lunpet hemocyanin utilizing a maleimide-activated carrier protein (Pierce Oiemical Co. #77107). 
300 pg of conjugated peptides were injected into chickens using Fteunds complete adjuvant 
The antisera were collected 5 weeks after initial immunization. Specific recognition by the 
30 antisera of commercial CTAP-ffl (23 pg. (Celsus Laboratories Inc., Cincinnati. Ohio; Cat #. 
41705), isolated heparanase (1.5 pg). and 10 pi of the platelet supernatant used for purification 
was achieved by separating the proteins on a reducing 18% polyacrylamide gel (Novex). 
transferring to nitroceUulose, and incubating with the pre-immune or antisera (1:1500). foUowed 
by incubation with a peroxidase labeled goat anti-chicken IgG (1:500; Kiericegaard and Perry) in 
35 the presence of PBS containing 5% dry milk and 0.05% Tween-20. Tb^ pie-immune sera did 
not recognize 7 - 10 kD proteins in the commercial CTAP-in. isolated heparanase. or platelet 
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supematants. 

Inhibition of heparanase activity by the C-teiminal synthetic peptide fSEQ ID NO; 42) or 
antisera; For expraiments designed to determine whether the peptide antisera was able to 
inhibit heparanase activity, the pre-immune and antisera were exchanged into 0.1 5M NaCl, 
5 O.OIM sodium phoshate buffer, pH 7.4 (PBS) using a 100 kD cut-off membrane in order to 
remove low molecular weight chicken heparanase normally present in the serum. Aliquots of 
isolated heparanase (15 ng) were pre-incubated for 30 min with 2 pi of either pte-immune or 
anti-CTAP-in antisera before adding the •'^S-HSPG to determine heparanase activity. In the 
presence of the pre-immune sera, the isolated protein had 14.3 ±0.1 units of heparanase activity, 

10 while in the presence of the C-terminal peptide antisera, only 0.8 ± 0.2 units of heparanase were 
detected (p < 0.(K)1; results confirmed in a second experiment). The N-terminal peptide 
antiserum was not able to neutralize the heparanase activity. Similar results woe obtained when 
the abihty of the synthetic p^tides to neutralize heparanase activity was examined. Heparanase 
assays conducted with 3 nM enzyme, 47 nM ^^S-HSPG substrate, and varying concentrations of 

15 peptides showed that heparanase activity was only 5% of control values in the presence of 250 
pM C-terminal p^tide. By contrast, h^aranase activity in the presence of 250 pM of either the 
N-terminal peptide or an unrelated peptide (PLALWAR) was 67% of control values. The ability 
of both the C-terminal peptide (SEQ ID NO: 42) or antisera from a chicken immunized with the 
C-terminal synthetic peptide to neutralize heparanase activity demonstrates conclusively that 

20 CTAP-in and NAP-2 possess h^aranase activity, and suggests that the C-teiminal region is 
essential for catalysis. Modeling of this domain (SEQ ID NO: 42) can be used m the 
identification of potent peptide-mimetic compoimds capable of inhibiting this enzyme activity. 

Computer assisted modeling can be acconq>lished using programs for automated docking 
of molecules within 3D databases, as described in DesJarlais, RX., Sheridan, RP., Seibel, G.L., 

25 Dixon, J.S., Kuntz, LD., Venkataraghavan, R., "Using shape compl^entarity as an initial screen 
in designing ligands for a reciter binding site of known three-dimensional structure"; J. Med. 
Chem. 31:722-729, 1988. Also, automated de novo construction of ligands that can bind the 
catalytic site as described in Moon, J.B., Howe, WJ., "Computer ctesign of bioactive molecules: 
a mefliod for leceptor-based de novo Ugand design"; Protems: Struct., Funcc, and Genetics, 

30 11:314-328, 1981. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Hcxjgewerf, Arlene J. 

Ledbetter, Steven R. 



(iii) NUMBER OF SEQUENCES: 4 3 

15 (iV) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: The Upjohn Company, Intellectual Property Law 

(B) STREET: 301 Henrietta 

(C) CITY: Kalamazoo 

(D) STATE: MI 

20 (E) COUNTRY: USA 

(F) ZIP: 49001 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

25 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 
3d (A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Viii) ATTORNEY/AGENT INFORMATION: 
35 (A) NAME: Jameson, William G. 

(B) REGISTRATION NUMBER: 27,199 

(C> REFERENCE/DOCKET NUMBER: 4731.1 CP 

(ix) TELECOMMUNICATION INFORMATION: 
40 (A) TELEPHONE: 616/385-7561 

(B) TELEFAX: 616/385-6897 

(C) TELEX: 224401 

45 (2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Asn Leu Ala Dys Gly :.-'s Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala 
15 10 15 

Glu Leu Arg cys Met Cys lie Lys Thr Thr Ser Gly lie His Pro Lys 
20 25 30 

Asa lie Gin Ser Leu Glu Val lie Gly Lys Gly Thr His Cys Asn Gin 
35 40 45 

Val Glu Val lie Ala Thr Leu Lys Asp Gly Arg Lys lie Cys Leu Asp 
50 55 60 
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Pro Asp Ala Pro Arg lie Lys Lys lie Val Gin Lys Lys Leu Ala Gly 
65 70 75 80 

Asp Glu Ser Ala Asp 
85 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

AACTTGGCGA AAGGCAAAGA GGAAAGTCTA GACAGTGACT TGTATGCTGA ACTCCGCTGC 60 

ATGTGTATAA AGACAACCTC TGGAATTCAT CCCAAAAACA TCCAAAGTTT GGAAGTGATC 120 

GGGAAAGGAA CCCATTGCAA CCAAGTCGAA GTGATAGCCA CACTGAAGGA TGGGAGGAAA 180 

ATCTGCCTGG ACCCAGATGC TCCCAGAATC AAGAAAATTG TACAGAAAAA ATTGGCAGGT 240 

GATGAATCTG CTGAT 255 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ser Ser Thr Lys Gly Gin Thr Lys Arg Asn Leu Ala Lys Gly Lys Glu 
15 10 15 

Glu Ser Leu Asp Ser Asp Leu Tyr Ala Glu Leu Arg Cys Met Cys lie 
20 25 30 

Lys Thr Thr Ser Gly lie His Pro Lys Asn lie Gin Ser Leu Glu Val 
35 40 45 

lie Gly Lys Gly Thr His Cys Asn Gin Val Glu Val lie Ala Thr Leu 
50 55 60 

Lys Asp Gly Arg Lys He Cys Leu Asp Pro Asp Ala Pro Arg He Lys 
65 70 75 80 

Lys He Val Gin Lys Lys Leu Ala Gly Asp Glu Ser Ala Asp 
85 90 



( 2 ) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SHX2 ID NO: 4: 
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TCCTCCACCA AAGGACAAA.C TAAGAGAAAC TTGGCGAAAG GCAAAGAGGA AAGTCTAGAC 60 

AGTGACTTGT ATGCTGAACT CCGCTGCATG TGTATAAAGA CAACCTCTGG AATTCATCCC 120 

AAAAACATCC AAAGTTTGGA AGTGATCGGG AAAGGAACCC ATTGCAACCA AGTCGAAGTG 180 

ATAGCCACAC TGAAGGATGG GAGGAAAATC TGCCTGGACC CAGATGCTCC CAGAATCAAG 240 

AAAATTGTAC AGAAAAAATT GGCAGGTGAT GAATCTGCTG AT 282 
(2) INFORKATION FOR SEQ ID NO: 5: 

(i) SEODENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala Glu Leu Arg Cys 
15 10 15 

Met Cys He Lys Thr Thr Ser Gly He His Pro Lys Asn He Gin Ser 
20 25 30 

Leu Glu Val He Gly Lys Gly Thr His Cys Asn Gin Val Glu Val He 
35 40 45 

Ala Thr Leu Lys Asp Gly Arg Lys He Cys Leu Asp Pro Asp Ala Pro 
50 55 60 

Arg He Lys Lys He Val Gin Lys Lys Leu Ala Gly Asp Glu Ser Ala 
65 70 75 80 

Asp 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GGCAAAGAGG AAAGTCTAGA CAGTGACTTG TATGCTGAAC TCCGCTGCAT GTGTATAAAG 60 

ACAACCTCTG GAATTCATCC CAAAAACATC CAAAGTTTGG AAGTGATCGG GAAAGGAACC 120 

CATTGCAACC AAGTCGAAGT GATAGCCACA CTGAAGGATG GGAGGAS.r VT CTGCCTGGAC 180 

CCAGATGCTC CCAGAATCAA GAAAATTGTA CAGAAAAAAT TGGCAGGTGA TGAATCTGCT 240 

GAT 243 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Glu Leu Arg Cys Met Cys lie Lys Thr Thr Ser Gly He His Pro Lys 
15 10 15 

Asn He Gin Ser Leu Glu Val He Gly Lys Gly Thr His Cys Asn Gin 
20 25 30 

Val Glu Val He Ala Thr Leu Lys Asp Gly Arg Lys He Cys Leu Asp 
35 40 45 

Pro Asp Ala Pro Arg He Lys Lys He Val Gin Lys Lys Leu Ala Gly 
50 55 60 

Asp Glu Ser Ala Asp 
65 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQDENCE CHARACTERISTICS: 

(A) LENGTH: 207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GAACTCCGCT GCATGTGTAT AAAGACAACC TCTGGAATTC ATCCCAAAAA CATCCAAAGT 60 

TTGGAAGTGA TCGGGAAAGG AACCCATTGC AACCAAGTCG AAGTGATAGC CACACTGAAG 120 

GATG6GAGGA AAATCTGCCT GGACCCAGAT GCTCCCAGAA TCAAGAAAAT TGTACAGAAA 180 

AAATTGGCAG GTGATGAATC TGCTGAT 207 
(2) INFORMATION FOR SEQ ID NO: 9: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQDENCE DESCRIPTION: SEQ ID NO: 9: 

Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
Ser Ser Thr Lys Gly Gin Thr Lys Arg Asn Leu Ala Lys Gly Lys Glu 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(Xi) SEODENCE DESCRIPTION: SEQ ID NO: 11: 

Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Tyr Ala 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ser Ser Ala Ala Gly Phe Cys Ala Ser Arg Pro Gly Leu Leu Phe 
15 10 15 

Leu Gly Leu Leu Leu Leu Pro Leu Val Val Ala Phe Ala Ser Ala Glu 
20 25 30 

Ala Glu Glu Asp Gly Asp Leu Gin Cys Leu Cys Val Lys Thr Thr Ser 
35 40 45 

Gin Val Arg Pro Arg His lie Thr Ser Leu Glu Val He Lys Ala Gly 
50 55 50 

Pro His Cys Pro Thr Ala Gin Leu He Ala Thr Leu Lys Asn Gly Arg 
65 70 75 80 

Lys lie Cys Leu Asp Leu Gin Ala Pro Leu Tyr Lys Lys He He Lys 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 13: 

55 (i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 439 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
65 CCGCAGCATG AGCTCCGCAG CCGGGTTCTG CGCCTCACGC CCCGGGCTGC TGTTCCTGGG 60 
GTTGCTGCTC CTGCCACTTG TGGTCGCCTT CGCCAGCGCT GAAGCTGAAG AAGATGGGGA 120 
CCTGCAGTGC CTGTGTGTGA AGACCACCTC CCAGGTCCGT CCCAGGCACA TCACCAGCCT 180 
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GGAGGTGATC AAGGCCGGAC CCCACTGCCC CACTGCCCAA CTGATAGCCA CGCTGZ^GAA 240 

TGGAAGGAAA ATTTGCTTGG ACCTGCAAGC CCCGCTGTAC AAGAAAATAA TTAAGAAACT 300 

TTTGGAGAGT TAGCTACTAG CTGCCTACGT GTGTGCATTT GCTATATAGC ATACTTCTTT 360 

TTTCCAGTTT CAATCTAACT GTGAAAGAAA CTTCTGATAT TTGTGTTATC CTTATGATTT 420 

TAAATAAACA AAATAAATC 4 39 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: linear 

(ii) MOIiECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Asn Gin Thr Ala He Leu He Cys Cys Leu He Phe Leu Thr Leu 
1 5 10 15 

Ser Gly He Gin Gly Val Pro Leu Ser Arg Thr Val Arg Cys Thr Cys 
20 25 30 

He Ser He Ser Asn Gin Pro Val Asn Pro Val Asn Pro Arg Ser Leu 
35 40 45 

Glu Lys Leu Glu He He Pro Ala Ser Gin Phe Cys Pro Arg Val Glu 
50 55 60 

He He Ala Thr Met Lys Lys Lys Gly Glu Lys Arg Cys Leu Asn Pro 
65 70 75 80 

Glu Ser Lys Ala He Lys Asn Leu Leu Lys Ala Val Ser Lys Glu Met 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 650 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 60 

AGCACCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA TCTTTCTGAC TCTAAGTGGC 120 

ATTCAAGGAG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 180 

CCTGTTAATC CAAGGTCTTT AGAAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 

CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 300 

TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 360 
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TJW^AACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 420 

CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 

GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 540 

GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 

GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGACC CTGCTTCAAA 650 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I/ENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: triJQ ID NO: 16: 

Met Ala Arg Ala Ala Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu 
15 10 15 

Arg Val Ala Leu Leu Leu Leu Leu Leu Val Ala Ala Gly Arg Arg Ala 
20 25 30 

Ala Gly Ala Ser Val Ala Thr Glu Leu Arg Cys Gin Cys Leu Gin Thr 
35 40 45 

Leu Gin Gly lie His Pro Lys Asn lie Gin Ser Val Asn Val Lys Ser 
50 55 60 

Pro Gly Pro His Cys Ala Gin Thr Glu Val lie Ala Thr Leu Lys Asn 
65 70 75 80 

Gly Arg Lys Ala Cys Leu Asn Pro Ala Ser Pro lie Val Lys Lys lie 
85 90 95 

Leu Asn Ser i 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1050 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CTCGCCAGCT CTTCCGCTCC TCTCACAGCC GCCAGACCCG CCTGCTGAGC CCCATGGCCC 60 

GCGCTGCTCT CTCCGCCGCC CCCAGCAATC CCCGGCTCCT GCGAGTGGCA CTGCTGCTCC 120 

TGCTCCTGGT AGCCGCTGGC CGGCGCGCAG CAGGAGCGTC CGTGGCCACT GAACTGCGCT 180 

GCCAGTGCTT GCAGACCCTG CAGGGAATTC ACCCCAAGAA CATCCAAAGT GTGAACGTGA 240 

AGTCCCCCGG ACCCCACTGC GCCCAAACCG AAGTCATAGC CACACTCAAG AATGGGCGGA 300 

AAGCTTGCCT CAATCCTGCA TCCCCCATAG TTAAGAAAAT CATCGAAAAG ATGCTGAACA 360 
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GTGACAAATC CAACTGACCA GAAGGGAGGA GGAAGCTCAC TGGTGGCTGT TCCTGAAGGA 420 

GGCCCTGCCC TTATAGGAAC AGAAGAGGAA AGAGAGACAC AGCTGCAGAG GCCACCTGGA 480 

TTGTGCCTAA TGTGTTTGAG CATCGCTTAG GAGAAGTCTT CTATTTATTT ATTTATTCAT 540 

TAGTTTTGAA GATTCTATGT TAATATTTTA GGTGTAAAAT AATTAAGGGT ATGATTAACT 600 

CTACCTGCAC ACTGTCCTAT TATATTCATT CTTTTTGAAA TGTCAACCCC AAGTTAGTTC 660 

AATCTG6ATT CATATTTAAT TTGAAGGTAG AATGTTTTCA AATGTTCTCC AGTCATTATG 720 

TTAATATTTC TGAGGAGCCT GCAACATGCC AGCCACTGTG ATAGAGGCTG GCGGATCCAA 780 

GCAAATGGCC AATGAGATCA TTGTGAAGGC AGGGGAATGT ATGTGCACAT CTGTTTTGTA 840 

ACTGTTTAGA TGAATGTCAG TTGTTATTTA TTGAAATGAT TTCACAGTGT GTGGTCAACA 900 

TTTCTCATGT TGAAACTTTA AGAACTAAAA TGTTCTAAAT ATCCCTTGGA CATTTTATGT 960 

CTTTCTTGTA AGGCATACTG CCTTGTTTAA TGGTAGTTTT ACAGTGTTTC TGGCTTAGAA 1020 

CAAAGGGGCT TAATTATTGA TGTTTTCGGA 1050 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQOENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: suaino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECOLE TYPE: peptide 

(Xt) SEQOENCE DESCRIPTION: SEQ ID NO: 18: 

Met Ala Arg Ala Thr Leu ser Ala Ala Pro ser Asn Pro Arg Leu Leu 
15 10 15 

Arg Val Ala Leu Leu Leu Leu Leu Leu Val Ala Ala Ser Arg Arg Ala 
20 25 30 

Ala Gly Ala Pro Lys Ala Thr Glu Lys Arg Cys Gin Cys Lys Gin Thr 
35 40 45 

Leu Gin Gly lie His Leu Lys Asn lie Gin Ser Val Lys Val Lys Ser 
50 55 60 

Pro Gly Pro His Cys Ala Gin Thr Glu Val lie Ala Thr Leu Lys Asn 
65 70 75 80 

Gly Gin Lys Ala Cys Leu Asn Pro Ala Ser Pro Met Val Lys Lys lie 
85 90 95 



60 (2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 1081 base pairs 
(B) TYPE: nucleic acid 
65 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CTCTCCTCCT CGCACAGCCG CTCGAACCGC CTGCTGAGCC CCATGGCCCG CGCCACGCTC 60 

TCCGCCGCCC CCAGCAATCC CCGGCTCCTG CGGGTGGCGC TGCTGCTCCT GCTCCTGGTG 120 

GCCGCCAGCC GGCGCGCAGC AGGAGCGCCC CTGGCCACTG AACTGCGCTG CCAGTGCTTG 180 

CAGACCCTGC AGGGAATTCA CCTCAAGAAC ATCCAAAGTG TGAAGGTGAA GTCCCCCGGA 240 

CCCCACTGCG CCCAAACCGA AGTCATAGCC ACACTCAAGA ATGGGCAGAA AGCTTGTCTC 300 

AACCCCGCAT CGCCCATGGT TAAGAAAATC ATCGAAAAGA TGCTGAAAAA TGGCAAATCC 360 

AACTGACCAG AAGGAAGGAG GAAGCTTATT GGTGGCTGTT CCTGAAGGAG GCCCTGCCCT 420 

TACAGGAACA GAAGAGGAAA GAGAGACACA GCTGCAGAGG CCACCTGGAT TGCGCCTAAT 480 

GTGTTTGAGC ATCACTTAGG AGAAGTCTTC TATTTATTTA TTTATTTATT TATTTGTTTG 54 0 

TTTTAGAAGA TTCTATGTTA ATATTTTATG TGTAAAATAA GGTTATGATT GAATCTACTT 600 

GCACACTCTC CCATTATATT TATTGTTTAT TTTAGGTCAA ACCCAAGTTA GTTCAATCCT 660 

GATTCATATT TAATTTGAAG ATAGAAGGTT TGCAGATATT CTCTAGTCAT TTGTTAATAT 720 

TTCTTCGTGA TGACATATCA CATGTCAGCC ACTGTGATAG AGGCTGAGGA ATCCAAGAAA 780 

ATGGCCAGTG AGATCAATGT GACGGCAGGG AAATGTATGT GTGTCTATTT TGTAACTGTA 840 

AAGATGAATG TCAGTTGTTA TTTATTGAAA TGATTTCACA GTGTGTGGTC AACATTTCTC 900 

ATGTTGAAGC TTTAAGAACT AAAATGTTCT AAATATCCCT TGGACATTTT ATGTCTTTCT 960 

TGTAAGGCAT ACTGCCTTGT TTAATGTTAA TTATGCAGTG TTTCCCTCTG TGTTAGAGCA 1020 

GAGAGGTTTC GATATTTATT GATGTTTTCA CAAAGAACAG GAAAATAAAA TATTTAAAJ^ 1080 

T 1081 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(a) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECaLE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Ala His Ala Thr Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu 
15 10 15 

Arg val Ala Leu Leu Leu Leu Leu Leu Val Ala Ala Ser Arg Arg Ala 
20 25 30 

Ala Gly Ala Ser Val Val Thr Glu Leu Arg Cys Gin Cys Leu Gin Thr 
"35 40 45 

Leu Gin Gly lie His Leu Lys Asn lie Gin Ser Val Asn Val Arg Ser 
50 55 60 

Pro Gly Pro His Cys Ala Gin Thr Glu Val lie Ala Thr Leu Lys Asn 
65 70 75 80 
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Gly Lys Lys Ala Cys Leu Asn Pro Ala Ser Pro Met Val Gin Lys lie 
85 90 95 

lie Glu Lys He Leu Asn Lys Gly Ser Thr Asn 
5 100 105 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SBQaEMCE CHARACTERISTICS: 
10 (A) LENGTH: 988 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 

(xi) SEQ0ENCE DESCRIPTION: SEQ ID NO: 21: 
CTCGCACAGC TTCCCGACGC GTCTGCTGAG CCCCATGGCC CACGCCACGC TCTCCGCCGC 60 

20 

CCCCAGCAAT CCCCGGCTCC TGCGGGTGGC GCTGCTGCTC CTGCTCCTGG TGGCCGCCAG 120 
CCGGCGCGCA GCAGGAGCGT CCGTGGTCAC TGAACTGCGC TGCCAGTGCT TGCAGACACT 180 
25 GCAGGGAATT CACCTCAAGA ACATCCAAAG TGTGAATGTA AGGTCCCCCG GACCCCACTG 240 
CGCCCAAACC GAAGTCATAG CCACACTCAA GAATGGGAAG AAAGCTTGTC TCAACCCCGC 300 
ATCCCCCATG GTTCAGAAAA TCATCGAAAA GATACTGAAC AAGGGGAGCA CCAACTGACA 360 

30 

GGAGAGAAGT AAGAAGCTTA TCAGCGTATC ATTGACACTT CCTGCAGGGT GGTCCCTGCC 420 
CTTACCAGAG CTGAAAATGA AAAAGAGAAC AGCAGCTTTC TAGGGACAGC TGGAAAGGAC 480 
35 TTAATGTGTT TGACTATTTC TTACGAGGGT TCTACTTATT TATGTATTTA TTTTTGAAAG 540 
CTTGTATTTT AATATTTTAC ATGCTGTTAT TTAAAGATGT GAGTGTGTTT CATCAAACAT 600 
AGCTCAGTCC TGATTATTTA ATTGGAATAT GATGGGTTTT AAATGTGTCA TTAAACTAAT 660 

40 

ATTTAGTGGG AGACCATAAT GTGTCAGCCA CCTTGATAAA TGACAGGGTG GGGAACTGGA 720 
GGGTGGGGGG ATTGAAATGC AAGCAATTAG TGGATCACTG TTAGGGTAAG GGAATGTATG 780 
45 TACACATCTA TTTTTTATAC TTTTTTTTTA AAAAAAGAAT GTCAGTTGTT ATTTATTCAA 840 
ATTATCTCAC ATTATGTGTT CAACATTTTT ATGCTGAAGT TTCCCTTAGA CATTTTATGT 900 
CTTGCTTGTA GGGCATAATG CCTTGTTTAA TGTCCATTCT GCAGCGTTTC TCTTTCCCTT 960 

50 

GGAAAAGAGA ATTTATCATT ACTGTTAC 988 
(2) INFORMATION FOR SEQ ID NO: 22: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

60 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Thr Ser Lys Leu Ala Val Ala Leu X^u Ala Ala Phe Leu He ! 
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Ala Ala Leu Cys Glu Gly Ala Val Leu Pro Arg Ser Ala Lys Glu Leu 
20 25 30 

Arg Cys Gin Cys He Lys Thr Tyr Ser Lys Pro Phe His Pro Lys Phe 
35 40 45 

lie Lys Glu Leu Arg Val He Glu Ser Gly Pro His Cys Ala Asn Thr 
50 55 60 

Glu He He Val Lys Leu Ser Asp Gly Arg Glu Leu Cys Leu Asp Pro 
65 70 75 80 

Lys Glu Asn Trp Val Gin Arg Val Val Glu Lys Phe Leu Lys Arg Ala 
85 90 95 

Glu 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQDEa*CE CHARACTERISTICS: 

(A) LENGTH: 291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ATGACTTCCA AGCTGGCCGT GGCTCTCTTG GCAGCCTTCC TGATTTCTGC AGCTCTGTGT 60 

GAAGGTGCAG TTTTGCCAAG GAGTGCTAAA GAACTTAGAT GTCAGTGCAT AAAGACATAC 120 

TCCAAACCTT TCCACCCCAA ATTTATCAAA GAACTGAGAG TGATTGAGAG TGGACCACAC 180 

TGCGCCAACA CAGAAATTAT TGTAAAGCTT TCTGATGGAA GAGAGCTCTG TCTGGACCCC 240 

AAGGAAAACT GGGTGCAGAG GGTTGTGGAG AAGTTTTTGA AGAGGGCTGA G 291 
(2) INFORMATION FOR SEQ ID NO: 24: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Ala Gly Pro Ala Ala Ala Val Leu Arg Glu Lys Arg Cys Val Cys Leu 
1 5 10 15 , 

Gin Thr Thr Gin Gly Val His Pro Lys Met He Ser Asn Leu Gin Val 
20 25 30 

Phe Ala He Gly Pro Gin Cys Ser Lys Val Glu Val Val Ala Ser Leu 
35 40 45 

Lys Asn Gly Lys Glu He Cys Leu Asp Pro Glu Ala Pro Phe Leu Lys 
SO 55 60 

Lys Val He Gin Lys He Leu Asp Gly Gly Asn Lys Glu Asn 
65 70 75 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOIiOGy: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GTGTTGCGGG AACTGCGGTG CGTGTGTTTA CAGACCACGC AGGGAGTTCA TCCCAAAATG 60 

ATCAGTAATC TGCAAGTGTT CGCCATAGGC CCACAGTGCT CCAAGGTGGA AGTGGTAGCC 120 

TCCCTGAAGA ACGGGAAGGA AATTTGTCTT GATCCAGAAG CCCCTTTTCT AAAGAAAGTC 180 

ATCCAGAAAA TCCTCGACGG CGGCAACAAA GAAAAC 216 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Met Gin Val Ser Thr Ala Ala Leu Ala Val Leu Leu Cys Tlir Met Ala 
15 10 15 

Leu Cys AsD Gin Val Leu Ser Ala Pro Leu Ala Ala Asp Thr Pro Thr 
20 25 30 

Ala Cys Cys Phe Ser Tyr Thr Ser Arg Gin lie Pro Gin Asn Phe lie 
35 40 45 

Ala Asp Tyr Phe Glu Thr Ser Ser Gin Cys Ser Lys Pro Ser Val lie 
50 55 60 

Phe Leu Thr Lys Arg Gly Arg Gin Val Cys Ala Asp Pro Ser Glu Glu 
65 70 75 80 

Trp Val Gin Lys Tyr Val Ser Asp Leu Glu Leu Ser Ala 
85 90 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

65 GAATTCAAGG CCTGTCCTGG TTTGGTCCCA ATTTACCTTT ATCATCCATA TTCACCCCCA 60 

CTGCTCTGCA GCTCCACTGA AGCACCCCCT CTTTCCTCTG AGCCACAATG TCACACCCAG 120 

GACTCTGCCT CAGCTGGGCC TCCACTGCCC ACCCATCTAT AGATGCCTAA ATCCCGGGCA 180 



wo 95/04158 PCTAUS94/08207 

-45- 

GTTATCCAGA CACAACTAAA GTTCCATCCC TTCCATGAAG CCTTCCCCAA CCCTCTGGTG 240 

GAAGGTCACT TCTTCCTCAT GGGGTTCTGA GCTTTCATTT CTTTTTCTAC TAAGAGTTTT 300 

5 ACAATTACCT GTTCATACAC TCTACCTGCC CCCATGAGAC CAGGGGCATC TCAGAAACAA 360 

AGATCATTAA AACCAACTAA ATCTATTTCT CATTATAAAA TGAGATATGC TGATTGATTG 420 

CAAAATAATA AAATAACAAA GTATGGAAAA GAAAAAAAAA AGCATATAAT CTGGCTGAGA 480 

10 

AGGTAGAGAC CCTTCCACAC CACTGAAATT ATGTGTTGAA AAGAATAAGG AAAAAACTGC 540 

TTCAGTTTGG CATTATTTAr GTAAGTATAG TATAGGATCC TTAAAATGGT TCAAAGAAAT 600 

15 GGGAAATCAA GACTTCATTT TGGCAAAGCC ATTGAACAGA AACTGTAGCA TATTTATCAG 660 

TAATTTCTTT CAGATTAAAC AACTGACAAC AACCCACTTT TCAACCAGTG ATGTTGGAAA 720 

TGTTTTAAAA CAAAATTAGT TCATAAATTT GTGGGTTGAC CAAGAAGGTA ATAAAGTCTC 780 

20 

ACTAAATAAA ATGAGGAAAA TTCAGAAAAA GAAAAAAATA AGAAAATAAA TCACCCATGG 840 

ATCTAAGCAC TATTCATTCT TTAAGGCATG TATTTCCAAG CCTTTTAATT TTTTCATGCC 900 

25 TAGAGTTGGC ATGGCATATA TATATCTTTA TACAATTCTT CAAATTTTAT AGAATTTGTA 960 

TAATGTTTTA TCTTGCTTTT TTTTTAACCA CTGATGTTAT AAGCATATTT ATGCCACTTC 1020 

ATTCACGTTA GAGACTTAAT AATAAAGGAT CTTGTGGATA ATTTATCATT CCCTGATAGA 1080 

30 

GAAAAATTTA GCTTTGCTTA TTTTAGAGTT ATAAATGATG CTGGGTCAGG TATCTTTATG 1140 

TTTGAAGATG GCTCCATATT TGGGTTGXTT CCACAGAACT CTTTCCAGAA AXGCTTTTTC 1200 

35 TAGGTTAATG GCTACACATA TTTCTAGGCA CCTGACATAC TGACACCCAC CTCTAAAGTA 1260 

TTTTTATGAT CCACAACTAG CGTTTAACAC AGCGCCCC?G TCACTCCGAG ACTAATAAAT 1320 

AGACAAATGA CTGAAACGTG ACCTCATGCT TTCTATTCCT CCAGCTTTCA TTGAGTTCCT 1380 

40 

TTCCTCTGGG AGGACTGGGG GTTGTCTAGC CCTCCACAGC ATCAGCCCAT TGACCCTATC 1440 

CTTGTGGTTA TAGCAGCTGA GGAAGCAGAA TTACAGCTCT GTGGGAAGGA ATGGGGCTGG 1500 

45 AGAGTTCATG CATAGACCAA TTCTTTTTTT tTTTTTTTTT TGAGATGGAG TTTCACTTTT 1560 

GITGCCCAGG CTGGAGTGCA ATGGCATGAT CTCAGCTCAC CACAGCCCCC ACCTCCIGGG 1620 

TTCAAGCGAT TCTCCTGCCC TCAGCCTCCC GAGTAGCTGG GATTACAGGC ATGTGCCACC 1680 

50 

ACGCCTGACT ACTTTTGTAT TTTTAGTAGA GATGGAGTTT CTCTTTCTTG GTCAGGTT6G 1740 

TCTCAAACTC CTGACCTCAG GTGATCTGCA GCCTCGGCCT CCAAAGTGTT GGGATTACAG 1800 

55 GTGTGAGCGA CCATGCCTGG CTGCATAGAC CAGTTCTTAT GAGAAGGGAT CAACTAAGAA 1860 

TAGCCTTGGG TTGACACACA CCCCTCTXCA CACTCACAGG AGAAACCCCA TGAAGCTAGA 19 "0 

ACCAGTCATG AGTTGAGAGC TGAGAGTTAG AGAGTAGCTC AGAGATGCTA TTCTTGGATA 19i:0 

60 

TCCTGAGCCC CTGTGGTCAC CAGGGACCCT GAGTTGTGCA ACACTCAGCA TGACAGCATC 2040 

ACTACACTTA AAAATTTCCC TCCTCACCCC CAGATTCCAT TTCCCCATCC GCCAGGGCTG 2100 

65 CCTATAAAGA GGAGAGATGG CTTCAGACAT CAGAAGGACG CAGGCAGCAA AGAGTAGTCA 2160 

GTCCCTTCTT GGCTCTGCTG ACACTCGAGC CCACATTCCA TCACCTGCTC CCAATCATGC 2220 

AGGTCTCCAC TGCTGCCCTT GCCGTCCTCC TCTGCACCAT GGCTCTCTGC AACCAGGTCC 2280 
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TCTCTGCACC ACGTGAGTCC ATGTTGTTGT TGTGGGTATC ACCACTCTCT GGCCATGGTT 234 0 

AGACCACATC AGTCTTTTTT TGTGGCGTGA GAGGCCCCGA AGAGAAAAGA AGGAAGTTCT 24 00 

5 TAAAGCGCTG CCAAACACCT TGGTCTTTTT CTTCACAACT TTTATTTTTA TCTCTAGAAG 24 60 

GGGTCTTAGC CCTCCTAGTC TCCAGGTATG AGAATCTAGG CAGGGGCAGG GGAGTTACAG 2520 

TCCCTTGTAC AGATAGAAAA ACAGGGTTCA AAACGAATCA GTTTGCAAGA GGCAGAATCC 2580 

10 

AGGGCTGCTT ACTTCCCAGT GGGGTCXGTT CTTCACTCTC CAGCTCACCC TAGTCTCCCA 2640 

GGAGCCCTGT CCCTTGGATG TCTTATGAGA GATGTCCAGG GCTTCTCTTG GGCTGGGGTA 2700 

15 TGACTTCTTG AACCGACAAA ATTCCATGAA GAGAGCTAAG AGAACAGTCC ATTCAGGTAT 2760 

CTGGATCACA TAGAGAAACA GAGAACCCAC TATGAAGAGT CAAGGGGAAA GAGGAATATA 2820 

GACAGAAACA AAGAGACATT TCTCTGCAAA ACCCCCCAAA TGCCTTGCAG TCACTTGGTC 2880 

20 

TGAGCAAGCC TGCCCTCCTC AACCACTCAG GGATCAGAAG CTGCCTGGCC TTTTCTTCTG 2940 

AGCTGTGACT TGGGCTTATT CTCTCCTTTC TCCGCAGTTG CTGCTGACAC GCCGACCGCC 3000 

25 TGCTGCTTCA GCTACACCTC CCGACAGATT CCACAGAATT TCATAGCTGA CTACTTTGAG 3060 

ACGAGCAGCC AGTGCTCCAA GCCCAGTGTC ATGTAAGTGC CAGTCTTCCT GCTCACCTCT 3120 

AGGGAGGTAG GGAGTGTCAG GGTGGGGGCA GAAACAGGCC AGAAGGCCAT CCTGGAAAGG 3180 

30 

CCCAGCCTTC AGGAGCCTAT CGGGGATACA GGACGCAGGG CACTGAGGTG TGACCTGACT 3240 

TGGGGCTGGA GTGAGGTGGG TGTTACAGAG TCAGGAAGGG CTGCCCCAGG CCAGAGGAAA 3300 

35 GGGACAGGAA GAAGGAGGCA GCAGGACACT CTGAGGGCCC CCTTGCCTGG AGTCACTGAG 3360 

AGAAGCTCTC TAGACGGAGA TAGGCAGGGG GCCCCTGAGA GAGGAGCAGG CCTTGAGCTG 3420 

CCCAGGACAG AGAGCAGGAT GTCAGGGCCA TGGTGGGCCC AGGATTCCCC GGCTGGATXC 3480 

40 

CCCAGTGCTT AACTCTTCCT CCCTTCTCCA CAGCTTCCTA ACCAAGAGAG GCCGGCAGGT 3540 

CTGTGCTGAC CCCAGTGAGG AGTGGGTCCA GAAATACGTC AGTGACCTGG AGCTGAGTGC 3600 

45 CTGAGGGGTC CAGAAGCTTC GAGGCCCAGC GACCTCAGTG GGCCCAGTGG GGAGGAGCAG 3660 

GAGCCTGAGC CTTGGGAACA TGCGTGTGAC CTCCACAGCT ACCTCTTCTA TGGACTGGTT 3720 

ATTGCCAAAC AGCCACACTG TGGGACTCTT CTTAACTTAA ATTTTAATTT ATTTATACTA 3780 

50 

TTTAGTTTTT ATAATTTATT TTTGATTTCA CAGTGTGTTT GTGATTGTTT GCTCTGAGAG 3840 

TTCCCCCTGT CCCCTCCACC TXCCCTCACA GTGTGTCTGG TGACAACCGA GTGGCTGTCA 3900 

55 TCGGCCTGTG TAGGCAGTCA TGGCACCAAA GCCACCAGAC TGACAAATGT GTATCAGATG 3960 

CTTTTGTTCA GGGCTGTGAT CGGCCTGGGG AAATAATAAA GATGTTCTTT TAAACGGTAA 4020 

ACCAGTATTG AGTTTGGTTT TGTTTTTCTG GCAAATCAAA ATCACTGGTT AAGA(?GAATC 4080 

60 

ATAGGCAAAG ATTAGGAAGA GGTGAAATGG AGGGAAATTG GGAGAGATGG GGAGCGCXGC 4140 

GACAGAGTTA TCCACTTCAC AAAATTCTGG AACATTGAAA CTACGAATAT GTTATAACTC 4200 

65 AAATCGTAAT ATGCACGCTC TAGGAGAATT AACTACTTGA ATGGCCACCA TTAAGCAGAG 4260 

TATTCTGTAG GGCATATTCA TGATGAATCA AGCTCTTAAT AGCAATTATT TACATTGTTG 4320 

AGGCTTACTC CTCCTACTGA GTGCTTTTTA TACATTGTTG ATTTAATCTT ACCAATGCAA 4 380 
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TAGTACAGCT TAGGTACTAT TAATACCTCC ACTTGACAGA AAAGTAACCC AGGGCTCAGA 4440 

AAGGTTAGAC AACTTGGCTG AGGTTACACA GCACGTAAAC GGTCAATTGT GTTCCAAAAC 4500 

5 TGGACTTTTA TTGAACTACA GACTATGCTG TTAACCATTG ACCAAGTTAT TTCCCAAAGT 4560 

ATGACCCGCC TATACTCAAA TCTTACCCCA TTCTTTAACA GATGATACTT TATCCATTGC 4620 

AACCACTTCC TGTCAGGATT CTGAGTTGAC ATAGAGTGTT TCAGCAGTGA TTATTTAAGC 4680 

10 

CAATTACATC AGGATCTTTA GGTGTAGACC TGGGAACTGA TATTTTTATC AAGCTCATGA 4740 

GGTGTTCCAT AGCATGTTAA TGACTGAGAG CCACTGTCAA TAGAATTC 4788 
15 (2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 cunino acids 

(B) TYPE: amino acid 

20 <C) STRANDEnSNESS : single 

<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Met Lys Leu Cys Val Thr Val Leu Ser Leu Leu Met Leu Val Ala Ala 
30 1 5 10 15 

Phe Cys Ser Pro Ala Leu Ser Ala Pro Met Gly Ser Asp Pro Pro Thr 
20 25 30 

35 Ala Cys Cys Phe Ser Tyr Thr Ala Arg Lys Leu Pro Arg Asn Phe Val 

35 40 45 

Val Asp Tyr Tyr Glu Thr Ser Ser Leu Cys Ser Gin Pro Ala Val Val 
50 55 60 

40 

Phe Gin Thr Lys Arg Ser Lys Gin Val Cys Ala Asp Pro Ser Glu Ser 
65 70 75 80 

Trp Val Gin Glu Tyr Val Tyr Asp Leu Glu Leu Asn 
45 85 90 

(2) INFORMATION FOR SEQ ID NO: 29: 

{ i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 696 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
TTCCCCCCCC CCCCCCCCCC CCCCGCCCGA GCACAGGACA CAGCTGGGTT CTGAAGCTTC 60 

60 

TGAGTTCTGC AGCCTCACCT CTGAGAAAAC CTCTTTTCCA CCAATACCAT GAAGCTCTGC 120 
GTGACTGTCC TGTCTCTCCT CATGCTAGTA GCTGCCTTCT GCTCTCCAGC GCTCTCAGCA 180 
65 CCAATGGGCT CAGACCCTCC CACCGCCTGC TGCTTTTCTT ACACCGCGAG GAAGCTTCCT 24 0 

CGCAACTTTG TGGTAGATTA CTATGAGACC AGCAGCCTCT GCTCCCAGCC AGCTGTGGTA 300 
TTCCAAACCA AAAGAAGCAA GCAAGTCTGT GCTGATCCCA GTGAATCCTG GGTCCAGGAG 360 
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TACGTGTATG ACCTGGAACT GAACTGAGCT GCTCAGAGAC AGGAAGTCTT CAGGGAAGGT 420 

CACCTGAGCC CGGATGCTTC TCCATGAGAC ACATCTCCTC CATACTCAGG ACTCCTCTCC 480 

5 GCAGTTCCTG TCCCTTCTCT TAATTTAATC TTTTTTATGT GCCGTGTTAT TGTATTAGGT 54 0 

GTCATTTCCA TTATTTATAT TAGTTTAGCC AAAGGATAAG TGTCCTATGG GGATGGTCCA 600 

CTGTCACTGT TTCTCTGCTG TTGCAAATAC ATGGATAACA CATTTGATTC TGTGTGTTTT 660 

10 

CCATAATAAA ACTTTAAAAT AAAATGCAGA CAGTTA 696 
(2) INFORHATION FOR SEQ ID NO: 30: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: cUBino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: peptide 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Gin lie lie Thr Thr Ala Leu Val Cys Leu Leu Leu Ala Gly Met 
15 10 15 

30 Trp Pro Glu Asp Val Asp Ser Lys Ser Met Gin Val Pro Phe Ser Arg 

20 25 30 

Cys Cys Phe Ser Phe Ala Glu Gin Glu lie Pro Leu Arg Ala He Leu 
35 40 45 

35 

Cys Tyr Arg Asn Thr Ser Ser He Cys Ser Asn Glu Gly Leu He Phe 
50 55 60 

Lys Leu Lys Arg Gly Lys Glu Ala Cys Ala Leu Asp Thr Val Gly Trp 
40 65 70 75 80 

Val Gin Arg His Arg Lys Met Leu Arg His Cys Pro Ser Lys Arg Lys 
85 90 95 

45 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 base pairs 
SO (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 31: 
ACCAGGCTCA TCAAAGCTGC TCCAGGAAGG CCCAAGCCAG ACCAGAAGAC ATGCAGATCA 60 
60 TCACCACAGC CCTGGTGTGC TTGCTGCTAG CTGGGATGTG GCCGGAAGAT GTGGACAGCA 120 
AGAGCATGCA GGTACCCTTC TCCAGATGTT GCTTCTCATT TGCGGAGCAA GAGATTCCCC 180 
TGAGGGCAAT CCTGTGTTAC AGAAATACCA GCTCCATCTG CTCCAATGAG GGCTTAATAT 24 0 

65 

TCAAGCTGAA GAGAGGCAAA GAGGCCTGCG CCTTGGACAC AGTTGGATGG GTTCAGAGGC 300 
ACAGAAAAAT GCTGAGGCAC TGCCCGTCAA AAAGAAAATG AGCAGATTTC TTTCCATTGT 360 
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GGGCTCTGGA AACCACATGG CTTCACCTGT CCCCGAAACT ACCAGCCCTA CACCATTCCT 420 

TCTGCCCTGC TTTTGCTAGG TCACAGAGGA TCTGCTTGGT CTTGATAAGC TATGTTGTTG 480 

5 CACTTTAAAC ATTTAAATTA TACAATCATC AACCCCCAAC 520 

(2) INFORMATION FOR SEQ ID NO: 32: 

<i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOUX3Y: linear 

15 (ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

20 

Met Lys Val Ser Ala Ala Leu Leu Cys Leu Leu Leu lie Ala Ala Thr 
15 10 15 

Phe lie Pro Gin Gly Lys Ala Gin Pro Asp Ala He Asn Ala Pro Val 
25 20 25 30 

Thr Cys Cys Tyr Asn Phe Thr Asn Arg Lys He Ser Val Gin Arg Leu 
35 40 45 

30 Ala Ser Tyr Arg Arg He Thr Ser Ser Lys Cys Pro Lys Glu Ala Val 

50 55 60 

He Phe Lys Thr He Val Ala Lys Glu He Cys Ala Asp Pro Lys Gin 
65 70 75 80 

35 

Lys Trp Val Gin Asp Ser Met Asp His Leu Asp Lys Gin Thr Gin Thr 
85 90 95 

Pro Lys Thr 

40 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 725 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTAACCCAGA AACATCCAAT TCTCAAACTG AAGCTCGCAC TCTCGCCTCC AGCATGAAAG 60 

55 

TCTCTGCCGC CCTTCTGTGC CTGCTGCTCA TAGCAGCCAC CTTCATTCCC CAAGGGCTCG 120 
CTCAGCCAGA TGCAATCAAT GCCCCAGTCA CCTGCTGTTA TAACTTCACC AATAGGAAGA 180 
60 TCTCAGTGCA GAGGCTCGCG AGCTATAGAA GAATCACCAG CAGCAAGTGT CCCAAAGAAG 240 
CTGTGATCTT CAAGACCATT GTGGCCAAGG AGATCTGTGC TGACCCCAAG CAGAAGTGGG 300 
TTCAGGATTC CATGGACCAC CTGGACAAGC AAACCCAAAC TCCGAAGACT TGAACACTCA 360 

65 

CTCCACAACC CAAGAATCTG CAGCTAACTT ATTTTCCCCT AGCTTTCCCC AGACACCCTG 420 
TTTTATTTTA TTATAATGAA TTTTGTTTGT TGATGTGAAA CATTATGCCT TAAGTAATGT 480 
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TAATTCTTAT TTAAGTTATT GATGTTTTAA GTTTATCTTT CATGGTACTA GTGTTTTTTA 54 0 

GATACAGAGA CTTGGGGAAA TTGCTTTTCC TCTTGAACCA CAGTTCTACC CCTGGGATGT 600 

5 TTTGAGGGTC TTTGCAAGAA TCATTAATAC AAAGAATTTT TTTTAACATT CCAATGCATT 660 

GCTAAAATAT TATT.GTGGAA ATGAATATTT TGTAACTATT ACACCAAATA AATATATTTT 720 

TGTAC 725 

10 

(2) IKFORMAXION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS :. single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

20 ■ 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

25 Met Lys Ala Ser Ala Ala Leu Leu Cys Leu Leu Leu Thr Ala Ala Ala 

15 10 15 

Phe Ser Pro Gin Gly Leu Ala Gin Pro Val Gly lie Asn Thr Ser Thr 
20 25 30 

30 

Thr Cys Cys Tyr Arg Phe He Asn Lys Lys He Pro Lys Gin Arg Leu 
35 40 45 

Glu Ser Tyr Arg Arg Thr Thr Ser Ser His Cys Pro Arg Glu Ala Val 
35 50 55 60 

He Phe Lys Thr Lys Leu Asp Lys Glu He Cys Ala Asp Pro Thr Gin 
65 70 75 80 

40 Lys Trp Val Gin Asp Phe Met Lys His Leu Asp Lys Lys Thr Gin Thr 

85 90 95 

Pro Lys Leu 

45 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
AGCAGAGGGG CTGAGACCAA ACCAGAAACC TCCAATTCTC ATGTGGAAGC CCATGCCCTC 60 
60 ACCCTCCAAC ATGAAAGCCT CTGCAGCACT TCTGTGTCTG CTGCTCACAG CAGCTGCTTT 120 
CAGCCCCCAG GGGCTTGCTC AGCCAGTTGG GATTAATACT TCAACTACCT GCTGCTACAG 180 
ATTTATCAAT AAGAAAATCC CTAAGCAGAG GCTGGAGAGC TACAGAAGGA CCACCAGTAG 240 

65 

CCACTGTCCC CGGGAAGCTG TAATCTTCAA GACCAAACTG GACAAGGAGA TCTGTGCTGA 300 
CCCCACACAG AAGTGGGTCC AGGACTTTAT GAAGCACCTG GACAAGAAAA CCCAAACTCC 360 
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AAAGCTTTGA ACATTCATGA CTGAACTAAA AACAAGCCAT GACTTGAGAA ACAAATAATT 420 

TGTATACCCT GTCCTTTCTC AGAGTGGTTC TGAGATTATT TTAATCTAAT TCTA;<;GAAT 480 

5 ATGAGCTTTA TGTAATAATG TGAATCATGG TTTTTCTTAG TAGATTTTAA AAGTTATTAA 540 

TATTTTAATT TAATCTTCCA TGGATTTTGG TGGGTTTTGA ACATAAAGCC TTGGATGTAT 600 

ATGTCATCTC AGTGCTGTAA AAACTGTGGG ATGCTCCTCC CTTCTCTACC TCATGGGGGT 660 

10 

ATTGTATAAG TCCTTGCAAG AATCAGTGCA AAGATTTGCT TTAATTGTTA AGATATGATG 720 

TCCCTATGGA AGCATATTGT TATTATATAA TTACATATTT GCATATGTAT GACTCCCAAA 780 

15 TTTTCACATA AAATAGATTT TTGTAAAAAA 810 

(2) INFORMATION FOR SEQ ID NO: 36: 

(t) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (ii) MOLECDLE TYPE: peptide 



(xi) SEODENCE DESCRIPTION: SEQ ID NO: 36: 

Met Lys Val Ser Ala Ala Arg Leu Ala Val lie Leu lie Ala Thr Ala 
15 10 15 

Leu Cys Ala Pro Ala Ser Ala Ser Pro Tyr Ser Ser Asp Thr Thr Pro 
20 25 30 

Cys Cys Phe Ala Tyr He Ala Arg Pro Leu Pro fug Ala His He Lys 
35 40 45 

Glu Tyr Phe Tyr Thr Ser Gly Lys Cys Ser Asn Pro Ala Val Val Phe 
50 55 60 

Val Thr Arg Lys Asn Arg Gin Val Cys Ala Asn Pro Glu Lys Lys Trp 
65 70 75 80 

Val Arg Glu Tyr He Asn Ser Leu Glu Met Ser 
85 90 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

60 

CCTCCGACAG CCTCTCCACA GGTACCATGA AGGTCTCCGC GGCACGCCTC GCTGTCATCC 60 

TCATTGCTAC TGCCCTCTGC GCTCCTGCAT CTGCCTCCCC ATATTCCTCG GACACCACAC 120 

65 CCTGCTGCTT TGCCTACATT GCCCGCCCAC TGCCCCGTGC CCACATCAAG GAGTATTTCT 180 

ACACCAGTGG CAAGTGCTCC AACCCAGCAG TCGTCTTTGT CACCCGAAAG AACCGCCAAG 240 

TGTGTGCCAA CCCAGAGAAG AAATGGGTTC GGGAGTACAT CAACTCTTTG GAGATGAGCT 300 
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AGGATGGAGA GTCCTTGAAC CTGAACTTAC ACAAATTTGC CTGTTTCTGC TTGCTCTTGT 360 

CCTAGCTTGG GAGGCTTCCC CTCACTATCC TACCCCACCC GCTCCTTGAA GGGCCCAGAT 420 

i TCTGACCACG ACGAGCAGCA GTTACAAAAA CCTTCCCCAG GCTGGACGTG GTGGCTCAGC 480 

CTTGTAATCC CAGCACTTTG GGAGGCCAAG GTGGGTGGAT CACTTGAGGT CAGGAGTTCG 540 

AGACAGCCTG GCCAACATGA TGAAACCCCA TGTGTACTAA AAATACAAAA AATTAGCCGG 600 

GCGTGGTAGC GGGCGCCTGT AGTCCCAGCT ACTCGGGAGG CTGAGGCAGG AGAATGGCGT 660 

GAACCCGGGA GCGGAGCTTG CAGTGAGCCG AGATCGCGCC ACTGCACTCC AGCCTGGGCG 720 

> ACAGAGCGAG ACTCCGTCTC AAAAAAAAAA AAAAAAAAAA AAAAAATACA AAAATTAGCC 780 
GCGTGGTGGC CCACGCCTGT AATCCCAGCT ACTCGGGAGG CTAAGGCAGG AAAATTGTTT 840 
GAACCCAGGA GGTGGAGGCT GCAGTGAGCT GAGATTGTGC CACTTCACTC CAGCCTGGGT 900 

j 

GACAAAGTGA GACTCCGTCA CAACAACAAC AACAAAAAGC TTCCCCAACT AAAGCCTAGA 960 

AGAGCTTCTG AGGCGCTGCT TTGTCAAAAG GAAGTCTCTA GGTTCTGAGC TCTGGCTTTG 1020 

> CCTTGGCTTT GCAAGGGCTC TGTGACAAGG AAGGAAGTCA GCATGCCTCT AGAGGCAAGG 1080 
AAGGGAGGAA CACTGCACTC TTAAGCTTCC GCCGTCTCAA CCCCTCACAG GAGCTTACTG 1140 
GCAAACATGA AAAATCGGGG 1160 

) 

(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

> (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOliOGY: linear 

(ii) MOLECaLE TYPE: peptide 

) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

> Met Arg lie Ser Ala Thr Leu Leu Cys Leu Leu Leu He Ala Ala Ala 
15 10 15 

Phe Ser He Gin Val Trp Ala Gin Pro Asp Gly Pro Asn Ala Ser Thr 
20 25 30 

) 

cys Cys Tyr Val Lys Lys Gin Lys He Pro Lys Arg Asn Leu Lys Ser 
35 40 45 

Tyr Arg Arg He Thr Ser Ser Arg Cys Pro Trp Glu Ala Val He Phe 
5 50 55 60 

Lys Thr Lys Lys Gly Met Glu Val Cys Arg Glu Ala His Gin Lys Trp 
65 70 75 80 

D Val Glu Glu Ala He Ala Tyr I^eu Asp Met Lys Thr Pro Thr Pro Lys 

85 90 95 

Pro 
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(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) -TOPOLOGY: linear 



10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

ACTGAAGCCA GCTCTCTCAC TCTCTTTCTC CACCATGAGG ATCTCTGCCA CGCTTCTGTG 60 

15 CCTGCTGCTC ATAGCCGCTG CTTTCAGCAT CCAAGTGTGG GCCCAACCAG ATGGGCCCAA 120 

TGCATCCACA TGCTGCTATG TCAAGAAACi VAAGATCCCC AAGAGGAATC TCAAGA6CTA 180 

CAGAAGGATC ACCAGTAGTC GGTGTCCCTG GGAAGCTGTT ATCTTCAAGA CAAAGAAGGG 240 

20 

CATGGAAGTC TGTCGTGAAG CCCATCAGAA GTGGGTCGAG GAGGCTATAG CATACTTAGA 300 

CATGAAAACC CCAACTCCAA AGCCTTGAAG AAATGTGCCT GAACAGAAAC CAACCTAGGA 360 

25 GCCAAGAAGC AAAAATTCCT CACCGCTGTT CTTTCTGAGA ACTGTTGATG AAATGTGTTG 420 

ATCACGGTCC TAAGGGATAG GAGCTGTCTG TAGGAATGTG AAACAGTCAC GCCTAAGGAA 480 

TGGTCTTTAA GTTATTAATA TTTTTATTTA ATTAGCCATG TACTTTGGTG TGATTTGAAT 540 

30 

GTAAAGCTCT GGAGACCTCA TGTCACTTTA ACATTGTGTT AGCTGCAGAA TTC 593 
(2) INFORMATION FOR SEQ ID NO: 40: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii> MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID ?JO:40: 

Asp Ser Val Ser lie Phe lie Thr Cys Cys Phe Asn Val lie Asn Arg 
15 10 15 

Lys lie Pro lie Gin Arg Leu Glu Ser Tyr Thr Arg lie Thr Asn lie 
20 25 30 

Gin Cys Pro Lys Glu Ala Val lie Phe Lys Thr Gly Lys Glu Val Cys 
35 40 45 

Ala Asp Pro Lys Glu Arg Trp Val Arg Asp Ser Met Lys His Lys Asp 
50 55 60 



(2) INFORMATION FOR SEQ ID NO:41: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



5 (Xi) SEOaENCE DESCRIPTION: SEQ ID NO: 41: 

Asn Leu Ala Lys Gly Lys Glu Glu Ser Leu Asp Ser Asp Leu Cys 
15 10 15 

10 (2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Cys Asn Gin Val Glu Val lie Ala Thr Leu Lys Asp Gly Arg Lys lie 
25 1 5 10 15 

Cys Leu Asp Pro Asp Ala Pro Arg lie Lys Lys lie Val Gin Lys Lys 
20 25 30 

30 (2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
TGCAACCAAG TCGAAGTGAT AGCCACACTG AAGGATGGGA GGAAAATCTG CCTGGACCCA 
GATGCTCCCA GAATCAAGAA AATTGTACAG AAAAAA 



60 
96 
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CLAIMS 

We claim: 

1 . A method of screening for AHA compounds comprising the steps of. 

a) contacting a compound with radiolabeled heparinrtieparan sulfate and 
5 heparanase; 

b) maintaining the compounds in contact with the radiolabeled heparinAieparan 
sulfate and heparanase for a time and under such conditions sufficient to aUow 
inhibition of heparanase activity; 

c) detecting inhibition of heparanase activity (a compound that gives 50% 
10 inhibition at a conceaitration of 1 pM or less); and 

d) selecting compounds that inhibit heparanase activity. 

2. A method according to claim 1 wherein the heparanase is recombinant. 

15 3. A heparanase havmg an isoelectric point of less than 5.5 and possessing activity greater 
than 20 imits heparanase activity per pg protein. 

4. A heparanase according to Qahn 3, having an isoelectric point of about 4.8 - 5.1. 

20 5. A heparanase purified to apparent homogeneity, as in claim 3. pr^ared in the presence of 
teducmg conditions and activated with transghitammase. having an amino acid sequence selected 
from the group consisting of SEQ. ID NO: 1. SEQ. ID NO: 3, SEQ. ID NO: 5 or SEQ. ID Na 



25 6. A h^aranase purified to apparent homogeneity, as in claim 3, prepared in the presence of 
reducing conditions, having &r amino acid sequence of SEQ. ID Na. 1. 

7. A heparanase purified to apparent homogeneity, as in claim 3. pr^ared in the presence of 
reducing conditions and activated with transglutaminase, having an amino acid sequence of 
30 SEQ. ID NO: 3. 

&. A heparanase according to Qaun 4, purified to apparent homogeneity, prepared in the 
presence of reducing conditions and activated with transglutaminase. 

35 9. A heparanase. as in claim 3. prepared by recombinant means, activated ^-ith 

transglutaminase and having an ammo acid sequence selected from the group consisting of SEQ. 
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ID NO: 1. SEQ. ID NO: 3. SEQ. ID NO: 5 or SEQ. ID NO: 7. 

10. A h^aranase, as in claim 3, prepared by recombinant means, activated with 
transglutaminase and having an amino acid sequence of SEQ. ID NO: 1 . 

5 

11. A heparanase, as in claim 3, prepared by recombinant means, activated with 
transglutaminase and having an amino acid sequence of SEQ. ID NO: 3. 

12. A heparanase purified to apparent homogeneity, as in claim 3, prepared in the presence 
10 of reducing conditions and activated with transglutaminase, and having an amino acid sequence 

selected from the group consisting of SEQ. ID. NO: 12. SEQ. ID. NO: 14; SEQ. ID. NO: 16, 
SEQ. ID. NO: 18, SEQ. ID. NO: 20, SEQ. ID. NO: 22 and SEQ. ID. NO: 24. 

13. A h^aranase purified to appaieat homogeneity, as in claim 3, prepared in the presence 
15 of reducing conditions and activated with transglutaminase, and having an amino acid sequence 

selected from the group consisting of SEQ. ID. NO: 26. SEQ. ID. NO: 28, SEQ. ID. NO: 30, 
SEQ. ID. NO: 32; SEQ. ID. NO: 34. SEQ. ID. NO: 36. SEQ. ID. NO: 38 and SEQ.ID. NO: 40. 

14. A method according to claim 1 wherein the heparanase is purified to apparent 
20 homogeneity, prepared in the presence of reducing conditions and activated with 

transglutaminase, and having an amino acid sequ^ce selected from the grot^ consisting of 
SEQ. ID. NO: 12, SEQ. ID. NO: 14; SEQ. ID. NO: 16, SEQ. ID. NO: 18. SEQ. ID. NO: 20, 
SEQ. ID. NO: 22 and SEQ. ID. NO: 24. 

25 15. A method according to claim 1 whra«in the heparanase is purified to apparent 
homog^eity, prepared in the presence of reducing conditions and activated with 
transglutaminase, and having an amino acid sequence selected from the group consisting of 
SEQ. ID. NO: 26, SEQ. ID. NO: 28. SEQ. ID. NO: 30, SEQ. ID. NO: 32; SEQ. ID. NO: 34. 
SEQ. ID. NO: 36, SEQ. ID. NO: 38 and SEQ. ID. NO: 40. 

30 

16. A method according to claim 1 wherein the heparanase is purified to apparent 
homogeneity, prepared in the presence of reducing conditions and activated with 
transglutaminase, and having an amino acid sequence selected from the group consisting of 
SEQ. ID NO: 1. SEQ. ID NO: 3, SEQ. ID Ntt 5 or SEQ. ID NO: 7. 

35 

17. A peptide having an amino acid sequence of SEQ. ID. NO: 42. 
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